os ARCHIVE many walt 
| for Sep [469 
RATIONAL MECHANICS 


and 


PANALY SIS 


| Edited by 
C. TRUESDELL 


Volume 5, Number 5 


SPRINGER-VERLAG 
BERLIN-GOTTINGEN - HEIDELBERG 


O| (Postverlagsort Berlin - 20.7. 1960) 


Mechanicam vero duplicem Veteres constituerunt: Rationalem quae per De- 
monstrationes accurate procedit, & Practicam. Ad practicam spectant Artes omnes 
Manuales, a quibus utique Mechanica nomen mutuata est. Cum autem Artifices 
parum accurate operari soleant, fit wt Mechanica omms a Geometria ita distinguatur, 
ut quicquid accuratum sit ad Geometriam referatur, quicquid minus accuratum ad 
Mechanicam. Attamen errores non sunt Artis sed Artificum. Qui minus accurate 
operatur, imperfectior est Mechanicus, & si quis accuratissime operari posset, hic 


foret Mechanicus omnium perfectissimus. NEwToN 


La généralité que j’embrasse, au lieu d’éblouir nos lumieres, nous découvrira 
plutét les véritables loix de la Nature dans tout leur éclat, & on y trouvera des raisons 


encore plus fortes, d’en admirer la beauté & la simplicité. EULER 


Ceux qui aiment l’Analyse, verront avec plaisir la Méchanique en devenir une 


nouvelle branche ... } LAGRANGE 


The ARCHIVE FOR RATIONAL MECHANICS AND ANALYSIS nourishes 
the discipline of mechanics as a deductive, mathematical science in the classical 
tradition and promotes pure analysis, particularly in contexts of application. Its 
‘purpose is to give rapid and full publication to researches of exceptional moment, 
depth, and permanence. 


Each memoir must meet a standard of rigor set by the best work in its field 
Contributions must consist largely in original research; on occasion, an expository 


paper may be invited. 


English, French, German, Italian, and Latin are the languages of the Archive. 
Authors are urged to write clearly and well, avoiding an excessively condensed or 
crabbed style. 


Manuscripts intended for the Archive should be submitted to an appropriate 
member of the Editorial Board. 


The ARCHIVE FOR RATIONAL MECHANICS AND ANALYSIS appears in 
numbers struck off as the material reaches the press; five numbers constitute a 
volume. Subscriptions may be entered through any agent. The price is DM 96.— 
per volume. 


Notice is hereby given that for all articles published exclusive rights in all 
languages and countries rest with Springer-Verlag. Without express permission of 
Springer-Verlag, no reproduction of any kind is allowed. 


For each paper 75 offprints are provided free of charge. 


ueniaiiél 


Measure-theoretic Foundations of Statistical Mechanics 


ROBERT M. Lewis 


Communicated by M. Kac 

Contents 
OCI ee lb Sewas at teles = AE el Jeep eeaius aca dale seats 355 
Oe, ee es 337 
Pe Ee ae Ee ee ne, ee ree ee 366 
RMR MON na Se Sg en, es a Se ele 366 
sae graced tempessiate OP) SPs a i ee 368 
Saree erases Sere bans is Ps ee PS ae Oe 369 
ese ANCA AC AMICOE 5 ie tii alpine: oie fm C8 ae eases Sonos 369 
E. The grand canonical distribution. «2 - - eee 373 
Seat TNAE A 18 SEM CM PMIEN ES a es pe ee ok, Sel Be Aon ie 373 
ees fe eo Turmber Sh SS Sees 28, Pel ees 378 
eR Sie be Sorta e ext es eee As ess poet ice amen at ded 331 

Abstract 


The basic formulas of classical equilibrium statistical mechanics are derived from 
wealknown theorems in measure theory and ergodic theory. The method used is a 
ization of the methods of Kuixcuix and Grap and deals with several, in 
fact a “complete set”, of “invariants” or “integrals of the motion”. Most of the 
results are simple corollaries of Binxuorr’s ergodic theorem, and since time-averages 
are used, the whole approach is characterized by an absence of statistical “ensembles” 
and probability notions. In the course of the development a “generalized temperature” 
is introduced, and a generalization of the second law of thermodynamics is derived. 
Formulas for the “microcanonical’, “canonical”, and “‘grand canonical” distributions 
appear as special cases of the general theory. 


Introduction 

The subject of this paper is the foundations of classical equilibrium statistical 
mechanics. The approach used is similar to that of A. I. KHINCHIN [5] and 
HL Geap [2] and is characterized by an absence of statistical “ensembles”; in 
fact, probability notions play no role in the theory. Our method generalizes 
that of Kuiscuix in two directions. First, following the suggestion of GRAD, 
we consider more than one invariant, or integral of the motion. Second, instead 
of considering transformations of phase space for which Lebesgue measure is 
invariant, we consider measure-preserving transformations of arbitrary measure 
spaces. 

Section 1, the mathematical theory, may be viewed as a small addition to 
measure theory and ergodic theory; in fact, most of the results may be considered 
corollaries of Binxuorr’s ergodic theorem. In the beginning of that section, 
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we use the theory of differentiation of set functions to arrive at a formula which 
is the natural generalization of the “‘microcanonical distribution”. This approach, 
which is very simple, is made possible by the introduction of the notion of a 
“complete set of invariant functions’. In order to treat a system in interaction 
with its environment we introduce a precise definition of ‘“‘weak interaction” ; 
and in deriving the generalized form of the “‘canonical distribution” we use 
Grab’s definition of a “large component” rather than KHINCHIN’s approach 
based on the central limit theorem. In our development, the exponential law 
of the canonical distribution appears as a natural consequence of an attempt 
to extend our results by the use of equivalent measures. 

The motivation of the mathematical theory appears in Section 2a. There 
we discuss physical systems for which an initial value problem is appropriate. 
We consider the “‘state space’ of such a system and introduce on this space 
measures which are preserved by the “‘solution operator’. Our approach leads, 
in Section 2b, to a generalized notion of temperature and, in Section 2c, to 
a generalization of the second law of thermodynamics. In Section 2d, we derive 
the usual results for classical mechanics from our general theory. There (see 
Lemma 6) we provide a proof of certain assertions which are frequently taken 
for granted. Finally, in Section 2e, we derive the “‘grand canonical distribution”’ 
within the framework of the general theory. 


As is already evident, we shall make use of the language and basic theorems 
of abstract measure theory. Hence we begin with a review of a few of the basic 
definitions: 


If "is any space, then (J’, x) is called a measurable space if W is a completely 
additive class of subsets of I’, 7.e., if xf contains the empty set, gy, and is closed 
under complementation and formation of countable unions. The elements of 
are called measurable sets. If (I, #) and (I”, x’) are measurable spaces, then 
a single-valued function / from I" into I” is a measurable function or measurable 
transformation if the inverse image of every set A’C€.A%’ is a set ACW. For 
real-valued functions, J” is the real line and, unless otherwise specified, .°7’ is 
taken to be the class of Borel sets, the smallest completely additive class of sets 
containing all the open sets. If (I, ..) is a measurable space, then (I’, , m) is 
a measure space if mis a measure function, 1.e., if m is a non-negative real-valued 
set function defined on © such that 


m (uA ») =m (A,) 


for all sequences {A,,} of disjoint sets of - and m(p)=0. Every measure space 


is assumed to be o-finite; 7.e., [’is the union of a countable collection of sets of 
finite measure. 


Let ([;, %, m,) be measure spaces for v=1,...,”. Let A, be a subset of 
I, and let y, denote a point (element) of I]. The product set A,X---xA, is the 
set of all ordered m-tuples (y,,..., y,) where y,€ A,. If each A, is a measurable 


set of finite measure, then A, x--- A, is called a rectangle. The smallest com- 
pletely additive class of subsets of I] x --- x I} containing all rectangles A, X--- XA, 
is called the product class and is denoted by %x---x.%,. It can be shown 
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that there exists a unique measure m on 2 X--+ XI, AX--+ x G,) such that 


nN? 


for every rectangle, m(A, x - =I] m,(A,). mis called the product measure 


and is denoted by m,x---xm,. "ne bs BD X+-+X DB, mxX---Xm,) is 
a o-finite measure space. 

If (I, %,m) is a measure space, a measurable transformation T from I” 
into itself is said to be measure-preserving if* m(T-1A)=m/(A) for every A C.%, 
and a real-valued measurable function /(y) defined on I’ is said to be an invariant 
function, or an invariant, if f(T y)=f(y) a.e. (7.e., “almost everywhere’, that is, 
except on a set of m-measure zero). If 7; is a family of measure-preserving 
transformations depending on the parameter ¢, then f is an invariant function 
relative to the family if there exists a set C of measure zero such that for y€ —C, 
/(Z:y)=f(y) for all t. —C denotes the complement of C. 


1. The mathematical theory 


Let (I, Y, m) be a measure space, and let 7; be a family of transformations 
of I’ into itself which are measure-preserving for tC Y, where Y denotes the 
real semi-axis O<t<oo. Let. denote the class of Lebesgue-measurable subsets 
of FY. We shall say that (I, W, m, T;) is a measure-preserving space whenever 
T;.,=T,T, for each ¢ and s in Y and J;y is a measurable function of ¢ and y; 
.e., I; y is a measurable transformation from (7 xI’, Hx) into (I, ¥). 

Our work in this section is based mainly on the following theorem of BIRKHOFF, 
a proof of which is presented in [4]. 


Ergodic Theorem. lf (I’, &, m, T;) is a measure-preserving space and f(y) is 
integrable on (I', %, m), then the limit 


(1) f(y = dnt [rane 


exists a.e. (The integral in (1) is a Lebesgue integral.) /*(y) is integrable and 
invariant relative to the family 7;. If m(J’) is finite then 


(2) S Ply) dm= J ty) am. 


Given a measure-preserving space (J’, Y%, m, T;), a set of real-valued measur- 
able functions y,,..., ¥, will be called a complete set of invariant functions, and 
the vector y(y)={y,,.-., ¥,} will be called a complete invariant vector if each 
of the functions y,(y) is an invariant relative to the family 7;, and if every 
real-valued measurable invariant function z(y) is a measurable function of 
{y,, ++, Ve} a.e. (If R* denotes k-dimensional euclidean space, and B® denotes 
the class of Borel sets in R*, a Borel measurable function is one which is measur- 
able from (R*, B*) to (R}, B!). We require that z(y)=Z[y(y)] a.e., where Z is 
Borel measurable.) For convenience we shall say that (I’,.%,m,T;,y) is a 
complete space whenever (I', %, m, T;) is a measure-preserving space, and y is 
a complete invariant vector. 


* T-14 denotes the set of all elements y such that Ty is an element in the set 4. 
24* 
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If (I, %, m, T,, y) is a complete space and / is integrable, then according to 
the Ergodic Theorem 
(3) f(y) =Fly(y)] ae. 


Thus we can evaluate /* if we can evaluate F. In what follows we shall show 
how this can be done. 

Given any Borel set B in B*, let A=y1(B)={y3 y(y) € B}*. Then AC # 
and for all £0, 7; maps A —C into itself, where C is a set of measure zero. 


Accordingly we may apply the Ergodic Theorem, using as the basic space not Ag. 
but A —C. It follows from (2) that if m(A)<oo 
(4) f ftdm= isle fdm 


A=6 
and since m(C)=0 


(5) J ftdm= f ftdm. 
A A 
Let M be the measure induced in (R*, B*) by y and m, i.e., for any BE BF 
(6) M(B) =m([y>(B)]. 


It can be shown that if G is a Borel measurable function such that G[y(y)] is 
integrable on (I', , m), then G is integrable on (R*, B®, M) and 


(7) pelea \jdm= dal )aM. 


It follows now from (3), a and (7) that for any B¢€ B* such that m[y4(B)]<co 


(8) pea) aM = JFly bei Sih aoe f fdm. 


a y (B) 
Integrability of F follows from that of /*. 


In order to evaluate F we shall apply a generalization of the well-known 
theorem which asserts that the derivative of an indefinite integral is equal a.e. 
to the integrand. We state, without proof, the following lemma, which follows 
easily from the theory of differentiation of set functions with respect to “‘nets’’. 
A full discussion of the theory appears in [6]. 


Lemma 1. Let B* be the class of Borel sets in R* and let (R*, B*, M) be a measure 
space such that M assigns finite values to finite intervals. Let F be integrable on 
(RY, BY, M). Then for almost every x€R* there exists a sequence of half-open 
intervals 


La)—=Wwod<y—xaqi=ty Ve =< ya = 7d; y=1,...,k} 


y= “py 


such that for each 7=1,2,...; ?<O0<d'; lim (d’—c!)=0; Eye) CL: and 
j- oe 
{ FdM 
9 line) eee ia 
0) jRoME ET Gyn et 


* The expression {y3 y(y)€ B} means: The set of all elements y such that y(y) 
is an element of the set B. 
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Now from (8) and (9) we have 
dm 


f 
(10) Bl yieealittig 2 Ee! a.e. 


From (3) we see that /* has the same value for all y for which y(y) has the same 
value. If y“4(x) denotes the set {y5 y(y) =~} it is natural to write /*[y(x)] 
for the value of /*(y) common to all y in y!(x). Summarizing our results, we have: 


Theorem 1. Let (I’, %, m, T,, y) be a complete space such that for every finite 
interval I in R*, m[y1(J)] is finite. Let f be integrable on (I, <,m). Then 


L 
fly) = lim > f f(Ty) at 


exists a.e. There exist sequences c! and d! of points in R® which depend on x, for 
which clh<O0<d, lim (d'—c!)=0, and 
j— oo 
fdm 


if 
(11) f*[y2(x)] =lim {ya d<y(y)—«<a} at Tape. 
i> myc! <y (y)—* Sd" 


Theorem 1 will be applied later to describe the behavior of isolated physical 
systems. Equation (11) is a generalization of what is sometimes called the 
“microcanonical distribution’. We turn now to the derivation of a formula 
appropriate to a physical system in interaction with its environment. 

Let y’={yi,..., ye} and y”={y,..., yg} be vector functions with values 
in R*,. The complete spaces (I’, Y’, m’, T/, y’) and (I, L", m"”, T,”, y’’) will 
be said to be in weak interaction if there exists a complete space (I’, /, m, T;, y) 
where! =I" XI"; J=A' xX DB"; m=m' xm" ;and y(y)=y(¥', 7") ={r >) Vas 
= y'(y')ty'"(y")*. If m[y4(I)] is finite for every finite interval J in R* and 
if f(y’) is integrable on (I”,.@’, m’), then f is integrable on (J’, ., m) and we 
may apply Theorem 1. It follows that 
(12) f*(y) = lim + a Ty) d 

L>o Te, 
exists a.e., and /*[y71(x)] is given by (11). 

We should note that although we have taken / to be a function of y’ alone, 
f* is a function of y=(y’, y’’) In (12), f(Zy) means /(8) where f is the first 
component of 7;y. According to our assumptions, dm=dm'dm" in (11), and 
we may use Fusint’s theorem to carry out the integration with respect to y”. 
The result is 
13) PFLy2(x)] <li LEO" 2 <r Wty" aaah dm’ 

Foe mye <y(y)—* Sa} 


In the applications, the system of interest will be represented by J”, and I” 
will represent the environment, which is assumed to be, in some sense, “‘large’’. 
Before giving a precise definition of “‘large’’, let us note that whenever (1”, 7’, 


* In the applications we shall see that it may be possible to satisfy the equation 
y=y'+y” only in an approximate sense. 
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m', T’, y’) and (I, Z", m", T/, y") are complete spaces in weak interaction, 
it is possible to find new measures #, p’ such that (1", #2, 1, y’) and 
(", J", p", T,’, y") are complete spaces in weak interaction and p> U")= pp" (he 
—1. We shall see shortly how this can be done*. 

Definition. (I"’, #’’, m”, T,’, y") is large compared to (1”, &%’, m’, T,’, y’), 
or in symbols (I"’, #”, m", T,", y")>>(I", B', m', T,, y’) if the two complete 
spaces are in weak interaction and the following condition is satisfied: Whenever 
pb", p’ are measures such that (I”’, 7”, p”, T;", t"), I", Hb TY, y’) are com- 
plete spaces in weak interaction and p'([”)=p"(I"’)=1, there exists a vector 
4 in R* (depending on pf’ and ”) such that for c<O0<d and d—c sufficiently 
small 


(14) g(V) =P fy" a¢<y"(y") + y' — AS ad} =const. 


for almost all values of y’ (i.e., except for y’ in some Borel set B such that 
b'{y' dy'(y’) € B}=0) ak 

Now suppose (I’, ”,m”, T;”, y”)>(I", Z', m’, T/, y'), and p”", p are 
measures with the above properties. Let p=’ x”, and let us use the abbreviated 
notation 


(15) bly (yl =0"' "ad <y'y) +9") —AS5 GF}, 
(16) b=plysd<y(y) -—ASa. 
Then from (13) 
(17) f*[y7(a)] = lim Sty) bly (v’)) 4p" 
joo 2 


If; is sufficiently large, then d’ —c! is as small as we wish and p"[y'(y’) |=)” ae. 
where p” is a constant. Thus 


(18) Si’) b' Ly’) db’ =Sh yao bh" =S tly’) ap’ So" Ly (y')) db’ = Sty’) db"-b 


and 
(19) My (A) =JS tly’) ae’. 


Equation (19) is a simple formula for /*(y), but it holds only for those points 
y such that y(y)=A. In order to obtain a simple formula for f*[y(x)] for 


* In case m’(I”)<oo and m’(I’)<co we need only set p’=m’'/m'(I’), p”= 
TO Wie Ie). 

** This definition is a generalized form of the definition of a “large component”’ 
introduced by Gran (see [2], p. 469). Its physical interpretation is discussed further 
in the next section. GRapb’s approach is certainly much simpler than that of Kuin- 
CHIN [5], who postulates that the environment consists of a large number, n—1, of 
weakly interacting systems more or less similar to the system of interest. In that 
case the quantities p’’ and p appearing in (17) may each be interpreted as the prob- 
ability that the sum of a large number of independent random variables falls within 
a certain interval. If we then choose 4 to be the mean of the distribution of the sum, 
and if we further require that the distributions be continuous (i.e., representable 
by density functions in R*), then we can apply the central limit theorem and obtain 
the result (19) asymptotically for n—> oo. 

In the applications we shall not require that (14) hold exactly; rather we shall 
suppose that there is some parameter, /, which measures how “large” the environment 
is, and (14) will hold asymptotically for /->+oo. Consequently (18), (19) and equations 
obtained from (19) will be viewed as asymptotic results. 
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values of x not equal to 4 we shall introduce new measures. In this connection, 
the following lemma will be used: 


Lemma 2. Let T; be a family of measure-preserving transformations with 
respect to (I’, %,m). Let p(y) be non-negative and integrable on (I, ,m). For 
each ACA, let q(A)=f v(y)dm*. Then T, is a family of measure-preserving 

A 


transformations with respect to (I’, , q) if and only if is an invariant relative 
to the family T,; and (I’, , m). 


Proof. By definition m(7/1A)=m(A) for A C&. 
1. Let » be an invariant: (T;y)=9(y) a.e. Then 
q(Z,* A) me 0D) dm Bap eA EY Aiea 2) dm =q(A). 
Thus 7; preserves q. 
2 lwet I, preserve g: g(1, 1A)—¢(A): Then 


Bolo GzeerG (Alicea, A) —S py) am 
for all A. Hence y(y)=(Z;y) a.e.; t.e., p is an invariant. 


Now suppose (1”, ¥', m’, T/, y’), (L"", LZ”, m", T;’, y”) are complete spaces 
in weak interaction. On each space we define a new measure 


(20) q(A')=So'y")am’, 9A") =f g"(y")dm". 


By the lemma, (1”, x’, q’, T;’, y’) will be a complete space if and only if q’ is 
an invariant with respect to (1”, ’, m’, T;’, y’). Since y’ is a complete invariant 
vector, this will be true if and Sale if y’(y’)=@'[y'(y’)], where ®’ is a Borel- 
measurable function on Be Similarly. (ih. “ig 7,", a) val be a complete 
space if and only if o’(y")=®" [y’"(y")]._ Now let g=q' xq” be the product 
measure on (J’, ¥). For mn rectangle 


q(A’ x A") = q'(A’) q''(A”’) = ews ta) Dah | dm" 
21 ar vA , at tt 
eu =f [Dy] O"'[y"]dm' dm" = f OG [y'] Gly] dm. 
vA Ae WA? Seal” 
It follows that for any A€.%, g(A)= f D'[y’] BD’ [y”’]dm. Now, as before, 
A 


(I’, %,q, T, y) will be a complete space if and only if 
(22) D'[y'| OT y"’) =e Dy] ae Dy’ + att | f 


where again ® is Borel-measurable on R*. (22) is a functional equation for the 
functions ©’, ®’’, ® which has the solutions B’[y']=c'e~*”; BD" [y"J=c"'e“*” ; 
Oyla-ce .€ ~ 7; Where G—= 4,-5.0, 15 an arbitrary vector in R*, c’ and c’’ 


k 
are arbitrary non-negative real numbers, and « - y= > 4;y;, etc. For convenience, 
j=l 


* Then g is a measure on (I’, #). 


362) Rosert M. Lewis: 


we shall refer to measures g and m which are related by equations of the form 
=fce* 7 dm, (ACL), as exponentially equivalent measures. 
A 


Now, let us assume that the complete spaces (1”, ’, m’, T, y’) and (I, 
A", m'', T/', y') are in weak interaction, where m' and m”’ are not assumed 
to be normalizable, i.e., we do not assume that m’(I”) and m’’(I”’) are finite. 
We now consider the class of normalizable measures which are exponentially 
equivalent to m’ and m’’: Let D’ and D” be subsets of R® consisting of all vectors 
a such that 


(23) z'(a) = fe *dm' ~and- 2a) = fe" ” dm” 


are, respectively, finite. We assume that D’>D’ and that D’ is a non-empty 
set. For each « in D’ set 


(24) 2 (ae) = 2'(a) 2°°(a) . 
By Fusini’s theorem 
(25) z(a) = fee Yt dm' dm” =f e-* dm. 


Now for A’€.%' and A” EE.” set 
(26) p(A’) = [2’(a ee Ydm';  p'(A")=[z"" (a)] 4 fewer dm”. 
A” 
Then if p=p x ae 
(27) p(A) = [z(a)] f eer dm 
A 


for every set AC.%. Of course, the measures p’, £’’, # depend parametrically 
on «. 

Now p'(I[”) =p" (Ll) =1, and we may apply (19). We summarize our results 
in the following theorem. 


Theorem2. “Let (1, of", m2", Te, > (PE on" FY VP Let DD rare 
2'(a) be defined as above and let D'’ > D' where D’ is a non- hoes set. Let f (y'\e“* ¥ VY) 
be integrable on (I”, ', m’) for each a in D'. Then 


(28) I*(y yop Lf T,y) d 


exists a.e. with respect to m' xm’, and for each « in D’ there exists a vector A, in 
R* such that 


(29) f(y (A,)] = F(a) aA Siy jeer dm’. 


Theorem 2, and in particular formula (29), is our main mathematical result 
and will form the basis for the physical applications. However, before pro- 
ceeding to these applications we shall take a closer look at some of the mathe- 
matical concepts, and derive a corollary to Theorem 2. We begin with some 
definitions. 

Let (R*, B*, M) be a measure space. B* denotes the class of Borel sets in R*. 
Let E be any open half-space in R* such that M(£)=0. Let F be the union 
of all such open half-spaces of measure zero. F is open, and its complement H 


i 
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is closed. Since H is the intersection of all closed half-spaces of ‘full measure” M, 
H will be called the closed convex hull of M. Let H be the complement of the 
closure of F. Then H is the largest open subset of H. H will be called the open 
convex hull of M. 


Lemma 3. If M(R*)<oo and 


ae ee GAVE 
~~ M(R*)? 
then x 1s a point in the closed convex hull H of M. (x denotes a point {x,,..., x,} 


in R*.) 


Proof. Suppose x¢H. Then x€F; hence CE where E is an open half- 
space of measure zero. By a rotation of coordinates we can assume that E is 
the open half-space a< x, where a is a real constant. Thus a<*%,. But 


45a 


Hence x;<a. This is a contradiction. 


We shall make use of this lemma a little later, but now we shall use the 
above definitions to examine more closely the notion of a complete set of in- 
variants. If (I’, W, m, T;, y) isa complete space, the invariant functions y,, ..., V, 
in a sense span the space of all invariant functions. The set ,,..., y, can always 
be enlarged by adding invariant functions and remains a complete set. But 
under what conditions can we reduce the number of functions and still have a 
complete set? The following lemma is an answer to this question. 


Lemma 4. Let (I, %,m, T;, y) be a complete space, where y={y,,..., Vp}. 
Let M be the measure induced in (R*, B*) by y and m, i.e., M(B)=m[y(B)] 


for BC B*, and let H be the open convex hull of M. If H is empty, then there exists 
a complete set of invariants {w,, ..., W,_1}. 


Proof. If H is empty, it is easy to see that there exists a hyperplane L ¢ R* 
of ‘‘full measure’’; 7.e., there exists a set L of the form 


(31) L= {ya 3 aly = 1) =o} 


such that M(—L)=0. The a; and uw; are real constants and at least one @; is 


non-zero. Hence we can define a non-singular matrix (4;;); 7,7=1, ..., 8; where 
@,—4;, 1=1,...,k. Let us introduce-the linear transformation 
k 
(32) Wy = 2s Bin (Ys — Mi) 3 y=1,...,k, 
The w, areinvariants. Now y(L) = {y 3 w, (y) =0};hence y+ (— L) = {yw (y) +0} 


and m[y3(—L)|=M(—L)=0. Thus oe Orate Pit ey is any invariant, 
then z(y)=Z[y(y)]=G[w,, ..., w,] a.e., where Z and G are Borel-measurable 
on R”,, Dhus.2(y) =G[Wy, 5 Weary 0) J [1,-1.5 Mp1] .2€.,, where J is Borel- 
measurable on R*-1. Thus w,, ..., @,_1 is a complete set of invariants. 
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We see from this lemma that if H is empty, we can always obtain a smaller 


set of invariants which is complete. If the new H is empty, we can repeat the 
process, | and so on, until eventually we reach a complete set of invariants for 
which H is non- empty, except in the case where the single function y(y) is a 
complete set of invariants and the corresponding H is empty. In this case the 
closed convex hull H consists of a single point c in R! and y=c a.e. A measure- 
preserving space will be called indecomposable* if the only invariant functions _ 
are those which are constant a.e. Whenever H is non-empty, the complete 
set of invariants y,,..., ¥, Will be said to be non-singular. Thus we have 


Lemma 5. Let (I', %, m, T,, {y,,.--, Vat) be a complete space. Then erther 
(I',. S,m,T;) is indecomposable, or there exists a non-singular complete set of 
invariants {z,...,2;}, where JSR. 

Let us consider, for a moment, the indecomposable case: (I’, Y, m, T;, y) 
is a complete space, and y=const. a.e. If m(I")<oo, we can apply the Ergodic 
Theorem immediately, and from (2) we see that 


(32) *(y) = lim at ft Ty) dt=([m(C 7 [to )dm=const. a.e. 


If the complete spaces (I”, ¥’, m’, T;, y’) and (I”’, LZ”, m”, T,'’, y") are in weak 
interaction, and both y’ and y” are constant a.e., then y=’+ y” is a constant 
ae. If m’(I”’) and m”’(I"’) are finite, then m(I)=m' xm" (I" xI’) is finite, 
and if f(y’) is integrable on (L”, 7’, m’), then 


63) Ay) = Jim + f (Ly) dt = fm) f fy’) dm [m7 [ty am 


We see from (32) and (33) that in the indecomposable case the calculations are 
essentially trivial. We shall not make use of these results in the applications. 
The importance of Lemma 5 is that except for the indecomposable case we 
may always assume that a complete space has a non-singular complete set of in- 
variants. We shall make use of this fact in what follows. 
Let us return now to Theorem 2. The results (28) and (29) hold, of course, 
when /(y’) is the vector function y’(y’). Thus 


34) 0 yy" (Ag) =e) = ’(@)AS y'(y' ene dm! = f y'(y') ap’, 


where #" is defined by (26). Now let M’ be the measure induced in (R*, B*) 
by m’ and y’. Similarly let P’ be the measure induced by #’ and y’. Since M’ 
and P’ have the same sets of measure zero they have the same open and closed 
convex hulls H and H. From (34) 

(35) V'(a) =f wd PS 

Rk 

and P’(R*)=1. It follows from Lemma 3 that y"(a) lies in H. Thus y’(«) is a 
single-valued function from D’ into H. We shall show that this function has 


* This is in agreement with the usual usage. (See e.g. [4], p. 25.) The terms 
“ergodic” and “metrically transitive” are also used. 
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a single-valued inverse on H, provided D’ is an open connected set. The follow- 
ing corollary is a generalization of a theorem of Grap [2], which in turn is a 
generalization of a theorem of KHINCHIN [5]. 


Corollary to the Theorem 2. Let H be the open convex hull of the measure M' 
induced in (R*, B*) by m’ and y'. Let D' be an open connected set. Then the function 
"(a ) defined by (34) has a single-valued inverse on H, 1.€., to any c={q, meee CEy 
mn H, there exists a unique « in D’ such that y'(ax)=c. Thus for y' in H, f= als a 
f(a) is given explicitly by (29), and a(¥’) is defined implicitly by (34) *. 


Proof. If v is any non-zero vector in R*, the equation v - (c—y’)=0 defines 
a hyperplane through c and perpendicular to v. Both open half-spaces bounded 
by this hyperplane must be of positive M’-measure; for if one were not, then 
every neighborhood of c would intersect F, the union of all open half-spaces of 
measure zero, and c would be in the closure of F, whereas it is in the comple- 
mentary set. Now let 


(36) Bee i 2 O)e pe 7) 


(37) Wea ae aa i] [e — yi] ew (c—y’) dm’. 


It follows from the above discussion that 
(38) OO eg (ae ee 


Hence (y;;) is a positive definite matrix and y(«) is a “‘convex” function. 

We shall show that y(«)—>0oo as «oo in any direction from an arbitrary 
point % in D’. Choose any unit vector 0 (a, -o,=1), and set c=a)+ta,, 20. 
As ¢ increases, either (A) « reaches the boundary of D’, or (B) ¢-+oo. In case (A), 
z’(a) becomes unbounded for finite «; hence y(«) becomes unbounded. In case (B), 
since H is open, there exists an e>0 such that the point c — eg, is in H. If we 
consider the hyperplane through that point, a, - (c — ea, — y’) =a: (c— y’) —e=0, 
the half-space «,-(c—¥’)—e>0 must be of positive M’-measure. Therefore 


m' {y' aq -[c — y'(y’)] > e} > 0. 


Thus 
wp (a) = f eleotta)—) dm’ > e%"© sé eta (e-¥ (40° 9" gy! 
(39) desley (¥N>e} 
ees i ene dm’. 


{YD a1: [e— 9 (y’)]>8} 
We see that p(x) =a,e’* where a,>0 and e>0. Therefore p(«)>0o as too. 


It follows that y(«) has a minimum at some point « in D’. Since D’ is con- 
nected, « is unique. The point « is also a minimum for log p=a - c+ log 2’(a). 


Hence, at this teen eat tad Seeeogep ets 
c= [z'(a)] 1 fy'(y') ee dm’ =¥' (a). 


* Note that except for the ees, case, we can assume that A is non- 
empty. This assures us that the corollary is not an empty statement. 
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2. Applications 
A. Initial value problems 


The mathematical theory we have developed will be applied to the study 
of the behavior of a system of the type for which an initial value problem is 
appropriate. This problem is to determine for all times, ¢, the “state” of the 
system when the state at f=0 is specified. The state is usually given by a set 
of numbers or functions. We shall denote the states, whatever they may be, 
by y, and the state space (the set of all states) by I. If we assume the existence 
and uniqueness of the solution of the initial value problem for all #20, then 
we may define a one-parameter family of transformations 7; of J’ into itself 
depending on the non-negative parameter ¢: For any yCJ’ and any t20, Ty 
is the state at time ¢ which is the solution of the initial value problem correspond- 
ing to the state y at t=0. TZ; is sometimes called the solution operator. It is 
immediately apparent that for each non-negative ¢ and s, 7;,,=7,7;. In general 
we are unable actually to solve the initial value problem, but we are interested 
in obtaining information about the system without knowing the transformations 
T, explicitly. 

If we can find a completely additive class of subsets of /’, a measure 
function m, and a complete invariant vector y, such that (/’, Y%, m, T;, y) is a 
complete space and m[y4(1)]. is finite for every finite interval J in R*, then 
we can apply Theorem 1: If f(y) is integrable on (/’, a, m), then the “‘infinite 
time average” 


L 
(40) f(y) = Jim +f f(y) at 


L>co 


exists a.e. and /* is given by the formula (41). If f(y) represents a physically 
observable quantity associated with the system in the state y, we may assume 
L 


that a measurement of f produces a ‘“‘finite time average” + [aa y) dt. Ifa 


large number of repeated measurements of / produce the suits value, we may 
say that the system is in “‘equilibrium with respect to /’, and it is reasonable 
to identify the measured value with the infinite time average (40). /*(y) may 
be called the “equilibrium value of /’’, and (11) provides a formula for this value 
which does not require a knowledge of the solution operator Jj. 

Physical systems which may be described in terms of an initial value problem 
are always isolated, 7.e., the influence of the physical environment on the system 
may be neglected. However, in many situations the system of interest* I” 
interacts with the environment to an extent which may not be ignored. In such 
cases, the solution of the initial value problem no longer describes the behavior 
of the system. However, it may be possible to consider a larger system, con- 
sisting of the system J” plus the environment, for which the initial value problem 
1s appropriate, 7.e., the combined system is isolated. 

If 7’ is the solution operator for the system J” when isolated, we assume 
that we can define a complete space (I”, ./’, m’, T/, y’). Similarly, we assume 


* The symbol J” will be used to stand both for the system of interest and for 
its state space. 
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that the environment is a physical system J’ whose isolated behavior is de- 
scribed by the solution operator 7,’ and that we can define a complete space 
(I"", L", m", T,;’, y’’). We assume that y’ and y’’ both have dimension k. Let 
us suppose, fs the moment, that there is no interaction among the systems. 
Let P=IY XI”, J=S' xB", m=m' xm", and let T, be the natural trans- 
formation of induced by 7; and T,”, 7.e., 


(41) Ly aly yj— ey, Ly’). 
Then it is easy to show that (I, %, m, T;) is a measure-preserving space and 
all the invariants y;, y;'; 7=1,...,; are invariants of J. 


Now suppose there is interaction between J” and J”’. Then J, will no longer 
be given by (41), nor, in general, will 7; be measure-preserving with respect to 
(I’, %, m,) nor will all the y;, y;’ be invariants of T;. However, if the magnitude 
of the physical interaction is small, we may suppose that some of the above 
features are preserved. This leads to the definition of ‘‘weak interaction” given 
in the previous secton. We assume that (I’, W, m, T;, y) is a complete space where 


(42) V={Vy co he} = 


Some of the requirements of the definition of weak interaction, in particular (42), 
may be met only in an approximate sense*. 

One further physical condition must be met before we can apply our theory: 
I’ must be “‘large’’ compared to J”. By this we mean that the behavior of 
the system /”’ is not appreciably affected by its interaction with 7”. In particular, 
the value of y=y’’+ y’ is not appreciably affected by fluctuations in the value 
of y’. This condition may be viewed as the physical interpretation of (14), which, 
in the applications, can be expected to hold only in an asymptotic sense. 

Now if 1°’, Ym’, T,’, vy") >> (I", &’, m', T;, y’), we may apply Theorem 2: 
If f(y’) is some physically observable quantity associated with the system J” 
in the state y’, the “infinite time average” 


(43) f* (y) = im rf iLy) 


exists a.e. with respect to m, and (29) provides a simple formula for /* which 
does not require a knowledge of 7;: /*=/(«). 

But presumably we wish to predict the value of /* which we may expect 
to measure in a given situation. For this, we must know the value of «. The 
significance of the corollary to Theorem 2 is that « can be obtained from a 
knowledge of y’, which is the infinite time average of y‘(y’), and thus is presum- 
ably measurable. Knowing y’, we can then predict that a measurement of / 
will produce the value f[«(¥’)]. 

Thus the situation is somewhat analogous to the case of the isolated system. 
There the value of / one would expect to measure was given by /*, and /* was 
uniquely determined by the (invariant) value of y. Here the value of / we expect 
to measure is given by }, and f is uniquely determined by the value of y’. 


* Thus, e.g., there may be some “interaction parameter’, ¢, and (42) may be an 
asymptotic relation valid for ¢ +0. 
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B. The generalized temperature 


Let us suppose that the system of interest, /”, itself consists of a fixed number, 
g, of systems in weak interaction. We suppose that g real physical systems are 
in weak interaction with one another, and the combined system is in weak 
physical interaction with its environment. 

Mathematically, in addition to the hypotheses of Theorem 2, we assume 
that (Lj, A;, m;, Tr, Vq)) is a complete space for 7=1,...,¢; where Voy(¥j) = 
ASya nates Cin el hee) =Tx-+-»xIj, GX XG, mX--+ XM); and 

q 


VS Pa Yj): 
j= 
We note that these assumptions imply that the complete spaces (Jj, %, m,;, 


q 
Tji, qi J=1,---,g are in weak interaction* We also note that D’ SiMe 
where D; is the set of vectors « in R* such that fe~* dm, < ow. 


From the conclusions of Theorem 2, we have 
q 
(44) a(a) = fee Ydm' = JT z(a); a (a) = fe 7 dm,. 
j=l 


For functions /(y,) such that f(y,)e~*'’® is integrable on (Jj, 4, m,) for each 
ain D,, 


F(a) = [2"(a) 17S Fly) eo 4O dam! = (2')> f fee dm, HL 4 
=) (23) St (rr) oF Mdm. 


If we omit subscripts ‘‘4’’ associated with the first component of the system 
of interest, 7.e. with the space J}, we have 


(45) 


(46) 2(a) = fe" dm, 

(47) (a) = [2(a)]4 f f(y) ee 2 dm. 

Note that although we have singled out the system J] for special treatment, 
we could have done the same for any system I}, 7=1, ..., g; with similar results. 


If we also omit the accent ’ in (23) and (29), we see that these equations are 
identical to (46) and (47). Now let us compare the two cases: (I) The system 
represented by J’ is in weak interaction with its environment. (II) The system 
represented by /’is in weak interaction with g—1 other systems, and the com- 
bined system is in weak interaction with its environment. 

We conclude from the above remarks that f is the same in cases (1) and (II) 
provided only that « is the same. Furthermore, suppose we now measure the 
quantity 7, (y;) in system I;, for 7=1,...,¢g. We would expect in each case to 
obtain a value /;(%) where « is the same for every 7, since we have seen that « 
is uniquely determined by y’. 

Up to now, we have viewed the parameter « merely as a mathematical 
convenience. In fact we went to great lengths to show that f could be expressed 
as a function of y’ rather than as a function of «. However, we see now that 


* The definition of ‘weak interaction” for more than two complete spaces is an 
obvious extension of our earlier definition. 
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a may be viewed as a parameter of the system which is the same for systems in weak 
interaction with one another. From this point of view, « takes on physical signi- 
ficance, and its behavior is very much like a temperature. For this reason, we 
shall refer to « as the generalized temperature of the system. 


C. The generalized ‘‘Second Law’’ 


Let B=(f;, ..., B,) be a vector in R’, and let us assume that y’ and 7,’ depend 
parametrically on £. In addition, we assume that, for each fixed value of f in 
some region I, of R’, all the hypotheses of Theorem 2 hold; hence all the con- 
clusions also hold. Then our main results are summarized in the formulas (we 
omit the accent) 


(48) z=2z(a;B)= fe" 9A dm, 
(49) f= Flo; B) == J f(y) e-* 7) am. 


If we assume that we can differentiate formally, we have 


(50) d(log 2) =— 27 fe we 1D y, da, ue 2% Br? a mM, 
(51) d (log z) = — Did pp? at 


Let us define 
(52) q=4(«,B,y)=ze%?; H=H(a,B)=J qloggdm. 


Then f gdm=1 and log q+ logz+a-y=0. If we ae this equation by q 
and integrate with respect to m over J’ we obtain 


(53) H +logz+a-¥=0. 
Differentiating (53) we have dH +d (log z) a d(¥;) )+ Laas = 0. Combin- 


ing this with (51) yields 
(54) — dH = ¥'a,4(9;) — > 2° (BE) ab. 


This equation will be called the generalized second law because of its similarity 
to the second law of Thermodynamics. Indeed we shall compare it directly with 
that law shortly. For later reference we shall also need the following equations 
which are obtained from (48) by differentiation: 


0 
(55) Jy = — = log, 


(56) Yasar) = ay 5 log z. 


D. Classical mechanics 
A mechanical system with s degrees of freedom may be described in terms 
of generalized coordinates 9g,,...,¢, and conjugate momenta #,,...,, which 
are solutions of HAMILTON’s equations 
dq,  0E UPN LEE EN OES 
(57) Gia apy? at aq; ’ Ge! 
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Here E=E(q,,.--,9;, Pir +++» Ps» #) is the total energy or ‘“‘Hamiltonian”’ of the 
system. We shall denote the 2s coordinates and momenta by a single vector 
O= (01, ---) Oxst= {Gay +--+ Is» Pr +--+» Ps} Which may be thought of as a point in 
2s-dimensional euclidean space or ‘‘phase space’, P. We shall assume that the 
mechanical system is ‘‘conservative’, 1.e., E is independent of ¢, and that E 
depends on 7 parameters f,,.-.,f,. Thus E=E(o;8) where B= Bie 
and f varies in some region I, of R’. We assume that as a function of (@, B) 
defined in Px JI,, E has the necessary properties such that the initial value 
problem defined by (57) and 


(58) e=e at t=0 


has a unique solution g(t; 0°, 8) which has continuous first derivatives with 
respect to all its arguments in R!x PxJ,. (Here R1is the set —co<t<oo.) 
If we define the solution operator 7; by 


(59) TL, 0° =VOsorr py, re coeererect) oP. re 
then for each fixed ¢ and £, 7; is a continuously differentiable transformation 


of phase space into itself. 


Now for any real s, the two functions 0 (¢; 7, 0°, B) and o(¢+s; 0°, B) are both 
equal to 7, 0° at T=0. It follows from the uniqueness assertion that the two 
functions are equal and hence that 


(60) OR ad Pil for all real ¢ and s. 


In particular 7_,7;=7)=J, where J is the identity transformation. Thus for 
every t, J; has an inverse, and 7J;-'= T_,. 
It follows easily from (57) that 


d dE dq; , @E ap; 
ae BP) P= Dae cae vagy ae 


(ony Lies ab. *¥ aB gE 


Ogi OP; Op; Ogi 
Thus E[g(¢); B)=E[e (0); B}, or 
(62) FT, 08: Bl == E | oo >6. for all¢€ R! and @°€ P. 
The following Lemma contains assertions which are needed in order to apply 


our theory to mechanical systems. The proof appears in Appendix 1. 


Lemma 6. Let yw denote Lebesgue measure, and L denote the class of Lebesgue 


measurable sets in phase space P. Let M denote the class of Lebesgue measurable 
sets in R1. Then 


(1) Lhe transformations T, are measure-preserving with respect to (PSone 


(II) Z;@ ts a measurable function of t and 9, ie., T;0 1s a measurable trans- 
formation from (R! x P, Mx L) into (P, L). 


(III) E ts a measurable function with respect to (P, L, 11). 


From (III) and (62) it follows that E 7s an invariant function for each fixed B, 
and from (I), (II) and (60) that (P, Y, u, T)) is a measure-preserving space. 
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We now restrict our considerations to mechanical systems for which the 
function E is, by itself, a complete set of invariants*. Experimental evidence 
indicates that this assertion leads to consistent results for a surprisingly large 
class of systems. Thus (P, #, u, T;, E) is a complete space. For many Hamil- 
tonians of practical interest, it can be verified that for every finite interval, J, 
in R}, 


(63) {E> (1)} =u {93 Elo) CI} <&. 


We restrict consideration to mechanical systems satisfying this condition, also. 


For isolated mechanical systems satisfying the above conditions, we may 
now apply Theorem 1. If f(@) is a “‘phase function”’, 7.e., if f is integrable on 
(P,£#, uw), then the infinite time average /*(9) exists a.e. If the system is in 
equilibrium with respect to /, we may predict that a measurement of / will 
produce the value /*, and we have the following formula: 


2 Peer, Ursteae ae lesa 
Fs O00 G Eo Nh) — 4 Sai} 


{*[E(x)] denotes the value of /*(0) common to all @ for which E(9) =x; c’, d/ are 
sequences of numbers, which depend on x, for which c}<0<d’, lim (d/—c!)=0. 
j—-oo 


Formula (64) is sometimes referred to as the microcanonical distribution. 


If our mechanical system interacts physically with its environment, we 
assume that we may treat the system and énvironment together as a mechanical 
system whose states are points in a phase space P= P’ x P”’, where P’ is the 
phase space of the system of interest and P”’ is the phase space of the environ- 
ment. It is certainly true that the transformations 7; of P determined by 
HAMILTON’S equations and the total energy & are measure-preserving with 
respect to (P, ¥, u), where yw is Lebesgue measure in P, and that E is an in- 
variant of 7;. We again restrict consideration to systems for which E is a complete 
set of invariants. If the magnitude of the physical interaction is small, then E 
will be approximately equal to the sum of the energies of the component systems. 
Thus the condition 


(65) E=E! +E" 


will be satisfied in an asymptotic sense. It is easy to show that (P, #, u)= 
(P’x P”, Z’xF",u' Xu"). Thus we may assume that the complete spaces 
(pee? ul, 2) (Pr . 2 sued, ) are. In-weak interaction; according to 
our definition of that term. For most Hamiltonians E’ of practical interest, 
the set D’ of all real numbers « such that 


(66) z(a) = f ent? dp! < 00, 
Ke 
consists of the set {0<a<oe}, which is an open connected set. Thus, in order 


to apply the conclusions of Theorem 2 and its corollary, we need only assume 
that the environment is “‘large’’ in the sense defined. 


* For a discussion of certain mechanical systems where E alone is not a complete 
set of invariants, see [2]. 
Arch. rational Mech. Anal., Vol. 5 25 
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Let us now examine the consequences of Theorem 2. To conform with general 
usage we shall denote « by #, and shall omit the accent. For #in D= {0<B< oo}, 


(67) 2(0,B) = f ePE@P dy. 
P 
If f(0)e~°” @*) is integrable on (P, ¥, w) for each # in D, then 


(68) j(8,p) == f He) PP OM dy. 


Formulas (67) and (68) are sometimes referred to as the canonical distribution. 


As in Section 2C, we define 


(69) gqarie*”, 

and 

(70) H = f qlog qd. 
Then from (53) . 

(71) H+logz+0E=0. 
From (54) 

(72) —dH=0d(E ieeoalnrs 5) 4B; 


Systems of the type we have been discussing are studied in thermodynamics. 
The main result of this study is the “‘second law of thermodynamics” which 
may be written as 


(73) TAS =dU+ > waB,, 


where g; denotes the “‘generalized force’ corresponding to the “‘external variable” 
8;, T’ denotes absolute temperature, U internal energy, and S entropy. Com- 
paring (72) and (73), we see that these equations are identical if we set 


ay Ns eee SRN IE fo re) ey 
(74) gakT; -kH=S; E=U; (59.) = 4 


The constant k can have any real value. By convention it is fixed at a certain 
value, and this determines a convenient temperature scale. In terms of the 
identities (74), z, which is called the partition function, may be written 

1 . 
(75) arte FP eed gape. 
P 
From (71) we have 


(76) —k Dilog2=U = S§. 


In thermodynamics the quantity U—TS is called the ‘“‘free energy” and is 
denoted by A. Thus 


(77) A=—kTlogz=— [logz, 
From (55) we obtain 


(78) = kT clog z = T a (=), 
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and from (56) 
(79) n= kT = 


g log z = — of 
Bi 0B; ” 

Equations (76), (77), (78), and (79) express the thermodynamic variables 
#;, U, A, and S as functions of T and B which involve only the partition function 
and its logarithmic derivatives. 


E. The grand canonical distribution 


We now consider a mechanical system consisting of 6 different kinds of 
particles. If m; denotes the number of particles of the 7 kind, and s; the number 
of degrees of freedom of the 7 kind of particle, then the states of the system 
can be represented in a phase space with points {9;;,, p;;,} where i=1,..., 6 
labels the kind of particle; 7=1,...,; numbers the particles of that kind; 
and k=1,...,s; determines the coordinates corresponding to each degree of 
freedom. The vector n= {,, ..., 3} determines the ‘‘composition’’ of the system, 
1.e., the number of particles of each kind. 

For a given composition, 2, we have a phase space P, of points ¢,= (9; ;%, Pijx)- 
We assume that as a function of @,,, 8 defined in P, x J,, the Hamiltonian E,, (g,,, B) 
has the necessary properties such that the initial value problem has a unique 
continuously differentiable solution. If 4,, denotes Lebesgue measure, #,, denotes 
the class of Lebesgue measurable sets in P,, and if S,, denotes the solution 
operator for the system, then we know from the discussion of the preceding 
section that (P,, Y,,u,, S,z) 18 a measure-preserving space and £, is an in- 
variant function. As before we shall restrict consideration to systems for which 
Po miseeby itself; ancomplete set ofinvariants. Thus (,, 2,, 14, nz, £,).18 a 
complete space. 

As in the preceding section, we shall assume that our physical system interacts 
with its environment, but the interaction will include not only the energy, but 
the particles themselves. Thus we shall suppose that particles can be exchanged 
between the system and its environment. In such a situation it is no longer 
possible to represent the state of the system by a point in a single phase space 
P,, but rather as'a point in UF,. In this connection, the following Theorem 


will be used. The proof appears in Appendix 2. 


Theorem 3. Let NW be the space of vectors n={ny, ..., Ns} where n; 1s a non- 
negative integer for i=1,...,0. For each nCW, let (Pi, Lys by, Sut» Siny) 0€ @ 
complete space, where 8(n)=8(nyr> «++ Sin)k= Bin) On)» ANA 0, denotes a point in P,. Let 
(80) eNO) A? 

nEN 
Let & be the class of subsets A of I such that 
(81) ANP,€Y, for every ne. 
For AC &, let 
(82) m(A) = Du, (ANF). 
nEN 


For each y=e,, in I’, let 


(83) TY = Sai On- 
BAS 
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Let n(y)=[m, ..., 3] be the vector function defined by 


(84) n(y) =" (On) =, 

and let 

Let y(y) be the (6+k)-component vector function defined by 
(86) y (y) = {n(y), &(y)}- 


Then (I', LZ, m, T,, y) is a complete space. }(y)=f,(@,) 1s a measurable function 
with respect to (I', L) if and only if f,(0,) 1s a measurable function with respect 
o (P,, Y,) for every nEN. If f(y) ts integrable on (I’, L, m) then f,, (On) =f (On) 
is integrable on (P,, Z,, Uy) for each nEN, 


(87) SN (dees Set SiON at Hae 
The nEN FP, 


and the series converges absolutely. Conversely if },,(0,) 1s non-negative and inte- 
grable on (P,, ,,u,) for each nEN, and if the above series converges, then 
f(y)=fn(@,) ts integrable on (I’, J, m) and (87) holds. 

We now apply this theorem setting g,)(0,)=£,(0,) (thus k=1). In order 
to apply Theorem 2 we set (I", LZ’, m’, T/, V)=(l, &, m, T;, y), and assumes 
that the environment can be represented by a similar space (I”’, ””, m’’, T;’, y’’) 
where (I7", SZ”, m"”, T,", y") > (I", BL’, m', T, y’). The vector « will be denoted 
by {A, 0} where A={A,,..., As} and a- y=A-n+0E,,. In stating the results, 
we omit the accent: For {A, #} in some subset D of R®t? (which is assumed to 
be a non-empty, open, connected set) 

(88) (A, 0; 8). Lert tdm= Die 1? fe eth g 

nN PB, 
If f(y)e7*' 7 =F, (0,)€77'" Puen) is integrable on (I’, x, m) for every a= {A, 9} 
in D, then 


(89) H(A, 0, B) — z1ffe*%dm => thy alg deer f (On) e— 9En(2n) aUy,. 
ne P, 


Formulas (88) and (89) are sometimes referred to as the grand canonical distri- 
bution. 


As in Section 2C we define 


(90) —e (ST ay 
(91) H=f qlogqdm. 
Then from (53) 
(92) H+logz+4-7+0E,=0. 
From (54) 
yyy he eRe OE Les 
(93) dH=9dE, | 3p, ) dBi + 2 Aah. 


systems of the type which we have been describing are also studied in thermo- 
dynamics. For this type of system the second law may be written 


(94) TdS=dU +) 2,dB,— > yjdy,. 
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Li; is the “partial free energy” or ‘‘partial thermal potential’ associated with 
the i kind of particle, and y; is the average number of particles of the i kind. 
Comparing (93) and (94), we see that these equations are identical if we set 


a 
a 


2 StL 3 2 oh AS ae 8 — (SE) = 9; 


(95) ; ; 
pe eee ae 
In terms of these identities, z, which is called the grand partition function 


may be written 


(96) tial 2d i5P {> ee [exp |— pp En (iin Piins BS dain aD; jp. 
ty ony MG= = 


n 


From (92) we have 


(97) —kTlogz =U — TS — ¥) 3,. 
From (55) we obtain ; 
é 

—— 2 
(98) Os hl op los 2 
and 

cA F) 
(99) Preah) a, logz, 
and from (56). 

oe F) 
(100) 2; = kT 2B; log z. 


Equations (97), (98), (99), and (100) express the thermodynamic variables 
#;,;, U, S as functions of 7, uw;, and £ which involve only the grand partition 
function and its logarithmic derivatives. 


Appendix 1: Proof of Lemma 6 

(1) The transformations T, are measure-preserving with respect to (P, L, mw). 

To prove (I) it must be shown that for every L€ ¥ and for every ¢, 71 LEY 
and u(Z;+L)=y(L). The proof depends on a famous theorem of of LIOUVILLE *. 

A. For every fixed ¢, the Jacobian of the transformation 7; is everywhere 
equal to 1. 

B. We shall use the following definition of Lebesgue measure**: Let C be 
the class consisting of the empty set gy and the open intervals {a<o<b}= 
(o4;— 0,<),; i=1,..., 2s} in P.. For any.set GEC, let 7(G) denote the 2s- 
dimensional volume of G. For any subset A of P, set 


[e.e) 


(A 1.4) ua(4) = int | 12(G,)9G,€C; UG, >}. 


n=1 
* is a set function defined on the class of all subsets of P and is called Lebesgue 
outer measure. By definition, # is the class of all sets L such that 
(A122) pe AjestR (AND) e(A-— 1) forevery, A cP; 
and y is the restriction of «* to the sets of #. 


* See [7]. 
*k See [6], p. 93. 
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C. Let G be the class of all continuously differentiable transformations of P 
with Jacobian everywhere equal to 1. Let C, be the class of all subsets £, of 
P such that E,=TE for some TCG and some ECC. For every set £, EC, 
let t(E,) be the 2s-dimensional volume of £,. (Then t(E,)=t(E).) For any 
subset A of P, set 


(A 13) pt(d) =int | 37 (E,)9E,€Gs UE, >A}. 


n=1 
We shall show that w*(A) =y*(A). In so doing we shall make use of the follow- 
ing theorem *: 
D. Every open set in P is the union of a countable class of disjoint, half- 
open intervals. 
E. For every set £, in C,, and every €>0, there exists a sequence of sets 
G* such that G*EC, U.G*S Ey, and ae (G*)<1(Ey) +e: 


If F is any half-open interval (a, i let t(f) denote the 2s-dimensional 
2s 


volume of F, 1.e., t(F)= [] (b;—a,). Given 6>0, there exists an open interval 
t=1 2s 


G=(a,c) such that FCG and t(G)= J] (c;—a,)<1t(F)+6. By D, there exists 
i=1 


a sequence of disjoint, half-open intervals F* whose union is E,. For each F* 
choose an open interval G* such that F*cG* and u(G)sr(F)+ =. Then 


UG UR Ss E,, and pa t (G*) See (F*) =¢+7(E)). 
Bisut (A)=4(A) for evyery*A CP: 
Since CC C,, it follows that for every Ac P 
(A 1.4) yt(A) <u*(A). 
Given any sequence of sets E,,€C, such that eee and given any e>0, 
we may, by E, choose sets G2 €C such that for each n U Gn >E,, and dt Meee 


t(E,) +52. Then U G>A and Yt (Gr) SD |r(E eee If 
we renumber the Gy with a single index, we have U G,>A and >) t fads A 
>'t(E,,). Since ¢ was arbitrary, it follows that s 


(A 1.5) by (A) = p*(A). 
G. For every L€F and every t, J7LE€Y and u(T4 Hat Ly: 
For any Ac P 
w(A) = wh(A) =inf | Dr(B,) EEC; UE, > 4} 
r= 


(A 1.6) d int { > ThE \S Pape Ge ote EPs 14} 


n=1 
fy (Lye esate 
™ Sees4], p. 126. 
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Let L be any set in Y. For any set BC P let A=Z{B. Then G1A=B and 
p(B) =u*(T1 A) = *(A) = "(A NL) $A —D) 
=n*(T (A OL) +e*(T(4 — 1] 
= "(TA ANTAL) +4 A — TL) 
=p"(BO TL) +e*(B— TL). 
Hence], “Le F and 
(A.1.8) w(T4L) = p(T L) = eX(L) =p (L). 


(A 4.7) 


II. 7;0 1s a measurable function of t and @, 1.e., T;9 is a measurable trans- 
formation from (Rx P, “x L) into (P, #): 

Given a measure space (I, %, m), let Z be the class of all subsets of sets 
of measure zero. Let © be the class of sets of the form zU A, where z€Z and 
A€x&. Then WV isa completely additive class of sets containing 7 and the 
extension m of m to W defined by 


(A 1.9) m(zU A) =m/(A) 
is a measure. m is called the completion of m. 

Let »v denote Lebesgue measure on (R?,.-@). It can be shown that the Borel 
sets in R* are a subclass of the Lebesgue measurable sets, and the restriction 
of Lebesgue measure to the Borel sets is a measure whose completion is Lebesgue 
measure. Thus if @ denotes the Borel sets in R! and » denotes the restriction 
of » to M, then. @=.M and 7=r. We shall also denote the Borel sets in P by 
GF and the restriction of bu to L by #. As before, L=L and ja 

It can easily be shown that if fis a measurable transformation from (X, %, a) 
into (Y, Z,t) such that o[f+(B)]=0 whenever t(B)=0, then f is a measurable 
transformation from (X,.%,6G) into (Y, &,T). Now f(t,e)=T;o is a continuous 
transformation from R! x P into P. From this it can be shown that / is measur- 
able from (R, xP, Wx Lf, vx) into (P, Y,“). Thus it will follow that f is 
measurable from (R, x P, @x L, vx) into (P, Y, wu) if we can show that 
(A 1.10) yx p[f4(L)]=0 whenever “[L]=0. 

Now if G is any subset of R, xP, the set G,={05(t,e) €G} is a subset of 
P called the section of G determined by t. We shall show that every section [f7(L) ], 
has fi-measure zero. From this it follows* that »x@[f4(L)]=0. Let f,(e)= 
f(t, e) =T, @. We have seen that for each fixed ¢, /, is a continuously differentiable 
transformation of P into itself with Jacobian 1. Now 


(A 1.11) [74 (L)]. = {e 3 f (te) EL} = {03h (0) EL} =f," (). 


If L is any Borel set of measure zero and € is any positive number, there exists 
an open set 0 of volume € containing L. Then /f;*(@) is also of volume € and 
contains f;1(L). Since € was arbitrary, it follows that / {f;* (L)}={[f" (L)]}=0, 
and the proof is complete. 


* See, e.g., [3], Sec. 36, Theorem A. 
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Ill. E is a measurable function with respect to (P, Pps) 
This is a simple consequence of the fact that E is a continuous function 
on P (see [6], p. 146). 


Appendix 2. Proof of Theorem 3 * 
1. J is a completely additive class of subsets of I’. 
pe&#. Let ACW; then (I’— A) OP,=P,—AMP,€ &, for every nE YW ; there- 
fore, (T—A)C€@. Let A,, Ag,... be in & and set A= U Ay. Then AQ P= 
(UA, ‘a P,) € &, for every nC WV; therefore, AC WH. 
2. m is a o-finite measure function defined on 7: 
Let A,, Ag, ... be a sequence of disjoint sets in . Then 


m(UA;) = 2 tn (Yn F,) = 22 tna VF) =) Dd Hn(A, NB) =>) m(A,). 
k ne n k kon 


R 
er Ey= VU Lae i, (La) <o- Lhen P= UUL ag, and m(L,4)=Mn(Lir) <3 
hence m is o-finite. 


3. f(v)=f,(@,) is a measurable function with respect to (/’, #) if and only 
if /,(@,) is a measurable function with respect to (P,,Y%,) for every nC: 
Forany Borelset BC R1,f1(B) NP, = {ya (vy) EB} N P= {on Stn (On) EBS =p (B)- 
Hence /1(B) €.# if and only if f;1(B) € Y, for every nEY. 
P 


4. Let f(y) =>) a,C.4,(y) be a simple function**. Then f,, (0,) =/(@,) is simple 
k=l 


for each nEN. If f(y) is integrable on (I, , m), then /,,(0,) is integrable on 
(Pe, ,) LOY CAC 1 © 


(A 2.4) J t(y) dm 7 2, Z| he (On) Aly, ? 


and the series converges. Conversely, if /,(@,) 1s integrable on (P,, Y,, u,,) for 
each »€ W and the series converges, then f(y) is integrable on (I, x, m): 


(A 2.2) Jt”) dm = 2 a,m(A,) = pa 4 2 Mn (Ag B) = 2 crs af C4, (On) UMn 
= 2 I tn (Qn) dn. 
Assertion 4, now follows from (A 2.2) and the definition of integrability for 


simple functions. 


5. Let f(y) be non-negative and measurable. Then /,,(0,)=/(0,) is non- 
negative and measurable for each n€.W. If f(y) is integrable, then f,(0,) is 
integrable for each »€.W, (A 2.4) is valid, and the series converges: 


If / is integrable, then, by definition, there exists a non-decreasing sequence 
f,, fe, ... of non-negative simple functions, each integrable on (I’, ,m) such 


* The definition of the integral and other measure-theoretic concepts used here 
are based on [6]. 


** C4,(y) denotes the characteristic function of the set A,,. For the definition of 
a simple function see [6]. 
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that, for each y€TI, jim fe(vy)=H(y), and such that lim f f,dm<oo. By 
definition cei cies, 


(A 2.3) J ily) dm = im f(y) dm 


Set) Ge. = J tl (0,)d@u,. Then Cr+, »=C,,- In the following argument we use 


the fact aah a convergent series of non-negative terms may be rearranged at will. 


[o.e) 


S tly) dm = jim nz Con = ME 2 Cea > Cr 
ie eet ay A NEN 


k=11"Ey 
as 2 > (CMa tae Cin) =a a 2 (Cetin =m Cy, n) 
= > jim Cy» = Ze uit fs (0n) Ay 


nEW Fr 


nEN Pr 


(A 2.4) 


6. Let 7, (0,) be non-negative and measurable for eachn€ WY. Then f(y) =f, (0,) 
is non-negative and measurable. If /,,(0,) is integrable for each ~€W and the 
series on the right side of (A 2.1) converges, then f(y) is integrable, and (A 2.1) 
is valid: 

It can be shown that* there exists a non-decreasing sequence /,, fy, ... of 
non-negative simple functions such that for every y€J’, lim /,(y)=/(y). Then 
Tim fr (On) = (On) =fn (Gn): Ty me is een phens vias 


Now if the series converges, we may reverse the steps in (A 2.4). By making 
use of Assertion 4, we see that f(y) is integrable and (A 2.1) is valid. 

7. Iff(y) is integrable on (I’, %,m), then f,,(0,)=/(@,) is integrable on 
(P,, Y,,¢,) for each n€.V, equation (A 2.1) holds, and the series converges 
absolutely : 

If { is integrable, then by definition /* and f- are integrable. Hence ff (@,) 
and f;, i are integrable for every nC. VY. By definition, 


nEN Pr nEN Fr 
Since both series converge, the right side of this equation converges absolutely 
and can be rearranged. Thus 


8. 708 is measure-preserving with aie to" ym), for 720: 
For any A€.a%, (T;1 A)NP,=T (ANB) =Si(ANP,) € GY, for every ne. 
Hence 771A EX. 
m(Tz* A) = ¥) Ma [(Trt A) OP] = 2 onl Sai (ANF) 
nENW nEN 


= 2) a, (ANF) =m(A). 
nEN 


(A 2.8) 


* See [6], p. 155. 
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OUT = 1 itor WwaO see 0s 
This follows immediately from the same assertion for the S,,,. 
10. T;y is a measurable function of ¢ and y: 


For each n€.%, S,;0, is a measurable function of ¢ and Q,, 7.¢., S,7Q, 18 a 
measurable transformation from (7xP,, 4x Y,) into (p,, Y,). Here aw 
denotes the class of Lebesgue measurable sets in 7 = {t30S?t}. Thus for any 
L,,€ FZ k(t, On) 3 Snt OnE Ln} CM X L,. Now, ae: ge AES, f(t, y) IT yCAS= 

ay Je, 0,) 31; 0,€AS= U JG, where G,,={(t, On) 3Syt0Qn€ ANP}. For each n, 


C is a set in the Hake Mk. Butye, fen mk MX Mixa, Tous: 


n 


for each n, G,CWxW; hence {(t, y) 3T yECASCAMXA; 1.e., T,;y is a measur- 
able transformation from (.7 xI’, @ x .#) into (I, #). 


11. (I, L, m, T;) is a measure-preserving space. 
This follows at once from 1, 2, 8, 9 and 10. 
12. g(y) is invariant with respect to (I’, W, m, T;): 


For each 120, ¢(Z;y) =&imy(Snt Qn) =8(n) (On) =8(y) except on a set SCP 
such that w,,(SOP,)=0 for each nE.Y. Thus m(S)=0. 


13. n(y) is invariant with respect to (I’, /, m, T;): 
14. y(y)=[n(y), g(y)] 1s a complete invariant vector on (I’, Y, m, T;): 


Let A(y) be any invariant function, 7.e., h(T;y)=A(y) a.e. [m]. For each 
fixed n, h(S,,0,)=h(0,) ae. [u,]. Hence, since g(,) is a complete invariant 
vector on (P,, Z,, fn» Sut), 4(On) =H, [&ny(On)] a-e. [u,], where H, is Borel- 
measurable on R*. Let z=(z,,..., 25) be a point in R®, and set 


“A - Hg) 2 wey 
We shall show that H is Borel-measurable on R**°®. If B is any Borel set in R} 
which does not contain the origin, then 
A1(B) = {(z,g) 3 A(z, g) € Bh = U {( (2,2) 5 A(z, 2)€ By z=} 
(A 2.10) Binal z=n}=U{(z, 2) 5g€H, (B); z= n} 
= UNe=—n} xi FCB. 
The set {z=} is just a point in R® and hence is a Borel set. H,-1(B) is also a Borel 


set in R*. Hence H+(B) is a Borel set in R*+*, Now let By be the (Borel) set 
consisting of the origin in R}, 


(A211) HA(By) = {(2,g)32¢1} UU {(z, 2) 9 H(z, g) =0; z=}. 


The set {(z, g)32¢.MN} is a Borel set in R**+®, and for each n 


{(z, 8) D(z, 2) =0; z=n}= {(z, 2) 5H, (g) =0; z=n} 
={(2ig)38¢H, (Bo); 2— nh (=x BACB) 


(A 2.12) 
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But this is again a Borel set; hence H(B,) is a Borel set in R**+°, and it follows 
that H is Borel-measurable on R**°. Now 


(A 2.13)  h(y) =h(On) = Ay [ny (On) ] = A LM, 8(n) (On) ] = A [n(y), e(y)], 


except on a set such that wu, (AN P,) =0 for eachnEM Hence h(y)=H[n(y), g(y)] 
a.e. [m]. 


15. (I’, @, m, T;, y) is a complete space. 
This follows directly from 11 and 14. 
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On the Minimality of Integrity Bases 
for Symmetric 3 x 3 Matrices 


G. F. SMITH 


Communicated by R. S. RIVLIN 


1. Introduction 

Finite integrity bases for five or fewer symmetric 3 x3 matrices under the 
orthogonal transformation group have been derived by RIVLIN & SPENCER [J], [4]. 
There is no assurance given, however, that smaller sets of invariants will not 
also form integrity bases. In this paper, it will be shown that the integrity bases. 
given in [1], [4] cannot be further reduced and they will be said to form minimal 
bases. In § 2, an expression is derived for FP, ;,_;,, the number of linearly in- 
dependent invariants of degree 7,7,...7;. A quantity of degree 1, 7,...75; is 
one of degree 1,,7%,,...,7; in the components of the matrices A,, A,,..., As 
respectively. The number of invariants 0;,;,.;, of degree 7, 7)...7; which are 
derivable from elements of the minimal basis of degree lower than 1 7)... 75 
may be readily computed, and it is shown in §3 that the number F, ;, i, — Ji, i,...i, 
gives a lower bound for the number of invariants of degree 7, 7,...7; in the 
minimal basis. In § 4, the minimality of the bases given in [1], [4] is established 
by showing that the number of invariants of given degree in these bases coincides 
with the lower bound derived by the methods of § 2 and §3. 


2. The Number of Linearly Independent Invariants 


In this section, an expression for the number P, ;,__;, of linearly independent 
invariants of degree 7,7,...7; will be derived. A function F of the matrices 
A, (=|la{)||) is said to be invariant under the group of proper orthogonal 
transformations {R} if 


Fay) =F(ay)) (47 =4,.-.,3) (2.1) 
where 
a) —1,,1,, a (2.2) 
17 —~ “ip Gq “pq ; 
for all  (=||J;;||) such that ; 


It is convenient to use the notation 


1 11> 2 — 42) as = ay3, (2.4) 
(k) — pk k k 
a= al, a al}. of = al. 
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Then, the invariance requirement (2.1) may be written as 
F(a\") = F(a) (fei 320). (2.5) 
The a\”) corresponding to a given l are of course determined from (2.2) and (2.4) 
and may be presented in the form 
= sae? (0,74 NESE) (2.6) 


where the matrices L (=||L;,||) are readily obtained from (2.2). Thus, the 
transformation matrix L of (2.6) corresponding to a rotation of # degrees about 
the x, axis for which 


4, O, 0) 
t=||0, cos#, sind (2.7) 
0, —sin?, cos? 
is given by 
ile O, O, O, O, O 
0, cos#, sin#?, 0, 0, 0) 
‘jae 0, —sin#?, cos®?, 0, 0, 2 (2.8) 
0, 0, 0, cos? %, 2cos# sin #, sin? } 
0, 0, 0, —cos#sin#, cos?# —sin?#, cos#sin’ 
0, 0, 0, sin? ?, —2cos#sin?, cos? 


Let L” denote the n‘* power of the matrix L and let S, denote the trace of L”. 
Then, from (2.8), 
S,; =2+2cos?+ 2cos 20 
and (2.9) 
S,=2+2cosn8+ 2cos2n#. 


The number B& of linearly independent invariants of degree one in the com- 
ponents as) of a single matrix A, is obtained by averaging oe: {R} the trace 
of the transformation matrix L of the independent components a”. This procedure 
is accomplished by integration over the group manifold and ihe result is given 
by 

2a 


= | Tr L(9) (1 —cos 9) dd (2.10) 
0 


where Tr L(9)=S, is given by (2.9). Details may be found in the standard 
references on group representation theory [2], [3], or in the papers of I. ScHuR [5]. 

In order to. determine the number P, ;,.. ;, of linearly independent invariants 
of degree 7, 7,...%5, it is necessary to average over {R} the trace of the trans- 
formation matrix of the independent quantities 


al al? ... aX) af? af)... ag) ...ap) ap)... apy (2.11) 
where 
Lege Rey on =a a 
and 


A She SS: kSkgS:Shk,,.. AShS Sh, 
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Let the matrix L”! (=LxLx--. x) denote the Kronecker* n power of 
the matrix L and the matrix L”) the symmetrized Kronecker n‘* power of the 
series L. Then, if the transformation properties of each of the sets of quantities 

) (j=1,...,6; kR=1,...,5) are given by the matrix L, the transformation 


(n 


aeitey) of the 6” quancnes an) a)... at, (jus 4-951 In = Inns), are given, by 


; 6+n—1 5 
the matrix L™! and the transformation properties of the | ay quantities 


aye dena oe Ta Ae oO fe he Pen) 


are given by the matrix L”) 
It is readily seen that the transformation matrix for the 


6+ 14,—1\/6-- 4,4 6+1%1;—1 
on ty 15 

quantities (2.11) is given by é ; 

LY) x L® x... x LM) (2.12) 
when the transformation matrix for the quantities a\* iG = yy 61) 
is given by L. The matrix (2.12) is the Kronccleer produats af the matries 
LL, ..., L) which are in turn the symmetrized Kronecker 41, ig, ..., 75 
powers of a matrix L. 


It is easily shown [2] that the trace of the Kronecker product of a number 
of matrices is equal to the product of the traces of the individual matrices. Thus, 


Tr(L™) x Lx 0. x EB) = Tr LY Tr EL | Tr BL, (2.13) 


The trace of the symmetrized Kronecker product L™) is given in terms of the 
traces S,, S,,..., S, of the matrix products L', L?, ..., L” by the relation (see 


[2], p. 106) 
(n) __ 1 Sy, ay Ss; Xe ey Swe, an 
Ink >i a! a! sal 1 | 2 ( n (2.14) 
where the summation is over the set of all positive integers %, %,...,%, such 
that 
2 “+ Nk, =n. 
Thus, Coa Died = +n, =n 
Day Roa ae 
Tr LY) — $ (Si Ss), 
‘Tr L® = (S3+3S, Sean Ga (2.15) 
Tr L = 7 (St+6S?S, 8 SS, Heros at 


(5) 


ae 
Le 
S 
7" 


| os = 


y (Sr +105; S2 + 20ST S3 +15 S, $3430 5; S,+ 20S, S34 24 S,). 


The number of linearly independent invariants of degree 1, 1,... 7, is then given 
by P,i,..;, where 


1 i 4 i 
Pyiguis = sa f Ten )Tr EL)... Tr B® (4— cos 8) dd (2.16) 


* For the definition and properties of the Kronecker products, see [2], Chap. 4. 
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where TrL“), TrL), ..., Tr) are functions of @ which are given by (2.9) 
and (2.14). With (2.14) and (2.15), the quantity TrL® TrL®) ... TrL) is 


expressible as a polynomial in the quantities S,, S,,..., S,. Thus, Panai, may 
be evaluated when the quantities 
22 
tif 1 tj 
ish... sh= 5 [ Si Sh... Sh(1 — cos) dd (2.17) 
0 


have been tabulated. Table 1 gives the values of the sis}... s* which are required 
in this study. 


Table 1 
Sas =D Sos ==4|6 EAC | sss 
1 i 1 St SY 2 S{=272 
Sy—2 POA 3 SiS a0 S355 =14. StSy=42 
| See SAS Sse a4: ge 5 
| | | S{S3=28 


3. The Number of Basis Elements 
In this section, an expression for the lower bound on the number of elements 
of the minimal basis of degree 1, 72... 7; will be derived. Let 


Vi, i,...i, = the number of elements of degree 7,7,...7; in the minimal basis 
UA op Oe 

Bi,i,...4, = the number of elements of degree 1,7,...7; in the basis J,,..., I 
given in [J], [4]. 


i, = the number of linearly independent invariants of degree 1 7,... 75. 


n 


O;; i,...i, = the number of invariants of degree 1, 7,...7; which may be formed 
from invariants of degree lower than 2, 7,...7; in the minimal basis 


j Sears Se 


It is clear that 8;.;,.i,27i,i,.,.i, since the y;,;,..4, represent the number of 
elements of degree 7, 7, ... 7;in the minimal basis. The quantities Pj, i, —Vi,i,... i, 
represent the number of invariants of degree 7,7,...7; in the minimal basis 
provided that the 0;,;,.;, invariants of degree 7, 7, ...7; which are formed from 
the elements of the minimal basis of degree lower than 7, 7, ... 7; are all linearly 
independent. Since this is not necessarily the case, 0, ;,..;, will give the upper 
bound for the number of Jinearly independent invariants of degree 1, 1)... 15 
which may be obtained from the products of elements of degree lower than 
i, 1,...%; in the minimal basis. Consequently, the quantity Fi, ..4,—ijs..4 
represents a lower bound for the number of elements of the minimal basis of 
degree 7, 7,...%;. Thus, 


PED a Cra Vita, Sad (3.1) 


The quantities #;,;,..;, may be computed from the expression 


5 


' — 1) (gs <1) [Pipipectp tp — 1 
cep ar ene lacie eee bp + Mp (3.2) 


Ny Neg Np 
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where the summation is over the distinct sets of positive integers (”,, 1,,J,, -+-» ®,) 


such that ‘ : : 
M4 + Note +--+ Mpty =1, 


My + Nee +o: + Np 1p =p 


My hy + Ng hg t+ +++ + Mp hy =F, 
and where at least one of the inequalities 
Lt * Tal ian, pe.” | ee Ae cee fs 


must be a strict inequality. 


4. Minimality of the Integrity Bases 


In this section, the minimality of the integrity bases derived by RIvLin & 
SPENCER [1], [4] for five or fewer symmetric 3 x3 matrices will be established. 
These integrity bases are given in terms of the quantities (4.1), ..., (4.5) below. 


I. Invariants involving a single matrix: 
Teas, Dae tea (4.1) 
II. Invariants involving two matrices: 
Tr A, A;, Tr A; A}, Tr Aj A;, Tr AZ AG (i <7). (4.2) 
III. Invariants involving three matrices: 


(i) TrA,4;A, (@<j<h). 


(ii) TrA,A,A?, TrA;A? A? (i<j <h) oP 
and invariants obtained from (ii) by permuting the matrices A,, ..., A, cyclically. 
IV. Invariants involving four matrices: 
(i) TrA;4;A,A,, TrA;A;A,A, (*<7<k<l)). 
(ii) TrA;A;A, Aj, TrA,A, A; A} (*<7<k<l) 
and invariants obtained from (ii) by permuting the matrices A;, ..., A; cyclically. 


(iii) TrA;A,;Aj A}, Tr A;A, A? A?, Tr A;A, A}. Ay, Tr A; A, A} A}, 
Tr A; A, A} A}, TrA,A,A} AP (i<j <k<i). 
(iv) Tr A;A,A; A, A} (t<7<k<l) (4.4) 
and invariants obtained from (iv) by permuting the matrices A;, ..., 4; cyclically 
V. Invariants involving five matrices: 
(i) TrA;A,A,A,A,,, TrA,A,A,A,,A,, Tr A; A;A,,A, A), 
Tr A;A,A,A;A,,, TrA,A,A,A,, A), TrA;A,A;A,A,,. 
(il) TrA;A;A,A,Ay,, Tr A,;A,A,A,A®?,, Tr A,A,A, A, Aj, 
Tr A,;A;A,A,Ai,  (t<j<k<l<m) 


(<7 <k<l<m) 


(4.5) 


and invariants obtained from (ii) by permuting the matrices A, wih, cyclically, 
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The minimality of the bases given in [7], [4] will be established by showing 
that the number of elements £;,;,_.;, of degree 7, 7,...7; in these bases coincides 
with the number of elements y,,;,.;, of degree 7,7)... 7; in the minimal bases. 
It is evident that the number of elements y;;,;, of degree 7,7,... 75; in the 
minimal basis is equal to the number of elements y,;j, of degree 7,7... 75 
where 7; 7...75 is any permutation of 7,7) ...7;. Since the number of elements 
of the bases in [Z], [4] have the same property, 7.e. » Bixiy.. b= Pi Ge 108 Tyas 
any permutation of 1, 7,...%5;, 1t will be necessary in the page to establish 
that ;,i,..4,=i,i,..i, for only one of the permutations of 7, 7... 75. 


Single matrix A,. The integrity basis is given by the quantities (4.1) with 
j=, (WS, 
B, = B2=P3=1. 
From (2.16) and Table 1, 


RH1; F=2,) B= 3. 
Then, from (3.1), 
f-O,=15%,56,=15"=fr=1, 


PF, —9,=15 72S f2=1>72=fh2=1, 
P,—%,=15 7,5 63=1>73=83 =1 


where, from (3.2), 

B=, aoe Os == 25 
Since B,,=0 implies that y,=0, it is seen that B;=y,; and hence the quantities 
(4.1) with +=1 form a minimal basis for a single matrix A,. 


Two matrices A,, A,. The integrity basis is given by the quantities (4.1) 
and (4.2) with 7,7=1, 2. It is clear from the previous section that the quantities 
(4.1) with 7=1, 2 belong to the minimal basis and it is only necessary to verify 
that the quantities (4.2) with 7,7=1,2 also belong to the minimal basis. From 


(4.2), 
Bir = Bie = Boo = 1. 
From (2.16) and Table 14, 


Py =2, Pip =4, Pa,» =9. 
Then, from (3.1), 


Py — 9 = 195711) SB = 1 SM = Bi = 1; 
Pe— 912 =15 12S Bi2=1 S12 =Bi2=1, 


Phy — 9a =15 Yoo S Bog =1 S22 = foo = 1 
where, from (3.2), 


Pr=1, e=3, Bo.=8. 


Hence, the quantities (4.1) and (4.2) with 7,7=1,2 form a minimal basis for two 
matrices A,, Ay. 


Three matrices A,, A,, A,. The integrity basis is given by the quantities (4.1) 
(4.2) and (4.3) with 7,7,k=1, 2,3. It is seen from the previous sections that 
the quantities (4.1) and (4.2) with 7,7=1, 2,3 belong to the minimal basis, and it 
is only necessary to verify that the quantities (4.3) with 7,7,k=1, 2,3 belong 
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to the minimal basis. From (4.3), 
Birr = 1, Bi2=1, Bioe=1- 
From (2.16) and Table 1, 
Pyar = 5, Pr2= 11; Pio = 20. 
Then, from (3.1), 
Puy — (i = 13 111 S fi = 1 PM =A = 1, 
Pe P12 = 15112 S Bre = 1 S12 =hi12 = 1; 
Poe — Prog = 15 M122 S Pio2=1 SV122= fio = 1 


where, from (3.2), 

Oit= 4, Via 10s Oia oS 
Hence, the quantities (4.1), (4.2) and (4.3) with 7,7, k=1, 2,3 form a minimal 
basis for three matrices A,,..., Ag. 

Four matrices A,, A,,-A,, Ay. The integrity basis is given by the quantities 
(4.1), (4.2), (4.3) and (4.4) with 7,7, k,/=1, 2,3, 4. It is seen from the previous 
sections that the quantities (4.1), (4.2) and (4.3) with 1,7, k=1, 2,3, 4 belong 
to the minimal basis, and it is only necessary to verify that the quantities (4.4) 
with 7,7, k, J=1, 2,3, 4 belong to the minimal basis. From (4.4), 


By =2, Pi12=2, Bii22=1, Pi1is=1- 
From (2.16) and Table 1, 
Pui = 16, Piie=38, Priv. = 96, Priis= 68. 
Then, from (3.1), 
Posi Aa = 2 S111 S Airs = 2 = %1111 = Pir = 2, 
Pause Arie = 2 S112 S Brie = 2 SM112 = Bi112 = 2, 
Pica — Pyro2 = 1 S122 SPrre2=1 SYi122=fii22=1, 
Puis— 1113 = 1 SV1113S Priis — = 1113 = Airis —="1 
where, from (3.2), 
Piii= 14, Prrr2 = 36, Prrx2=95, P1113 = 67. 


Hence, the quantities (4.1), (4.2), (4.3) and (4.4) with 7,7, k, /=1, 2,3, 4 form a 
minimal basis for four matrices Ay, ..., Ay. 

Five matrices Ay, Ay, Az, Ay, A;. The integrity basis is given by the quantities 
(4.1), (4.2), (4.3), (4.4) and (4.5) with 7,7, k,l, m=14, 2,3, 4,5. It is seen from 
the previous sections that the quantities (4.1), (4.2), (4.3) and (4.4) with i, 7, k, J= 
1, 2,3, 4,5 belong to the minimal basis and it is only necessary to verify that 
the quantities (4.5) with 7,7, k, 1, m=1, 2, 3, 4,5 belong to the minimal basis. 
From (4.5), 

Pri111 = 9, Byiii2= 4. 
From (2.16) and Table 1, 


Puirr= 6295 Pisa = 1575 
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Then, from (3.1), 


Pay An111 = 6S M1111 SPs =6 = 711111 = Bii111 = 9, 
Puasa — A112 = 4211112 SAi1112 =4 > V11112 = Pirie = 4 


where, from (3.2), 
O11111 = 56, O11119 = 153. 


Hence, the quantities (4.1), ..., (4.5) with 7,7, k, 1, m=1, 2, 3, 4, 5 form a minimal 
basis for five matrices A,,..., As. 
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Fourier-Transform Methods 
in the Theory of Approximation 


P.. L. BUTZER 


Communicated by C. MULLER 


§ 1. Introduction and the Transform Method? 


Integral transform methods such as Fourier-transform methods have proven 
to be of great importance in the solution of initial and boundary value problems 
in mathematical physics. The general method is to transform a given (partial) 
differential equation, involving an unknown function, into an equation involving 
the Fourier transform (or some similar transform) of the original function. This 
new equation is then solved, and an appropriate inversion theorem is applied 
to obtain the solution in terms of the original function space. These methods 
have been investigated not only from a formal but also from a rigorous point 
of view by S. BocHNER [4], G. Doetscu [2/], J. L. B. Cooper [18], E. C. Titcx- 
MARSH [33] and many others. 

The object of this paper is to study such Fourier-transform methods in 
connection with a class of problems in the theory of approximation. This method 
was first considered by the author in [//], and it is the aim here to continue 
these investigations. Some of these and related problems have been investigated 
from an entirely different point of view by E. HItLE ([27], p. 323 and p. 386) 
and the author [8, 9, 10] using a semi-group method. The latter method has 
the same degree of generality as the Fourier-transform method, both possessing 
a general approach and unified theory. It may be remarked that in case the func- 
tions under consideration belong to non-reflexive Banach spaces, e.g. Ly(—©~, oo), 
or the approximating singular integrals do not satisfy the semi-group property, 
the Fourier method is generally the more advantageous of the two. 


We proceed with an outline of the detailed results and the transform method. 
Let us denote the Fourier transform of the function f€ L,(— oo, o) by 


(1.1) Si=f, feo) = (12x) J ix) eras, 
and the Fourier-Stieltjes transform of g by 
(1.2 SSE) =¥, Fl) = (12x) J e-**de (a), 


—e,°} 


g being of bounded variation on (— ov, ov), 


‘ The results of this paper were presented at a meeting of the DMV in Miinster, 
October 20, 1959. A summary of these results appeared in the Comptes rendus de 
l’Académie des Sciences (Paris); see [12]. 
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Given a general singular integral 


(13) h(t) == | fet) Ren) au 


with parameter g>0 and kernel k(u) (see §3 for properties of k), the problem 
is to consider the convergence of J,() towards f(x) for large 9 and to characterize 
the class of functions 7 for which the degree of approximation of f by j,(x) in 
the L,(— 0, oo) norm is exactly of order O(@~”), yO, #.e. for which 


(1.4) DID NO) SSA =Oles ss, 2}: 


The Fourier-transform method in handling problems of this type may be 
formally explained as follows. Let us suppose that the degree of approximation 
of f by J,(x) is given by 

DoJif)=o(e”), eter. 


An application of Fourier transforms to the latter equation gives 


l(o) [4 — &(o/e)]] < (1 iN) J Ile (x)]e-#*|dx=0(0~”), 


(1.5) Jim 9? /(e) [4 — k(v/0)] =O, gyior-all 


For certain kernels of frequent use in mathematical physics (see §3), the trans- 
formed equation 


clo|” f(v) =0, for all v, 


for constant c>0, results. Then a suitable inversion theorem is applied, yielding 
the solution /=0 a.e. in (— ©, oo). 

In the case 
(1.6) D (Jf) =O(e)s 
this equation is transformed via Fourier transforms (for suitable kernels) into 
(1.7) clv|”f(v) = (v), for all », 


where g is of bounded variation on (— oo, oo). In terms of the original functions, 
the solution to our problem then is that f belongs to a certain class K. In the case 
y=2, the class K is the class of functions for which the derivative /'(x) is of 
bounded variation on (— o9, oo). 

Problems of this type are classified under the inverse theorems in the theory 
of approximation. The corresponding direct problem is the following: Given an 
integral of type (1.3) with an appropriate kernel and given that / belongs to 
a certain class K such that a condition of type (1.7) is satisfied, then one must 
prove that (1.6) holds. To solve this latter problem, the transform method may 
again be applied. 

The problems formulated above must be approached from a rigorous point 
of view. This gives rise to the following definition: 
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Let y(o) be a positive non-increasing function of @ such that w(e) | 0 as 
0 + co. If there exists a class K of functions f such that 


(i) D,J;f) =o(p(g)) implies that /(x) is constant, 
(ii) D,(J;f) =O(v(Q)) implies that fe K, 
(iii) for any f¢ K one has D, (J; f) =O (¢(0)), 


then the singular integral J,(x) is said to be saturated with order O(p(e)), and 
K is called the saturation class of the given J, (x). 

This concept was first introduced into the theory of approximation by 
J. Favarp [23] and formulated for semi-group operators by the writer [8]. It 
enables one to restate the above problems in accurate terms. 

The method of approach broadly outlined above is not restricted to (non- 
periodic) functions of class L(—oo, o). In the case of periodic functions f, 
having period 27, one may consider integrals of the form 


and in discussing problems of the type under consideration, instead of applying 
Fourier transforms, one may employ the Fourier coefficients as a suitable trans- 
form: 


A 


fe (n) = (1/22) fifty et done 202 al ae 


This general method is implicitly contained in [1/], especially Theorems 5.1 
and 6.3, and is being investigated at present by G. SuNoucui & C. WaTarI [32]. 
But here it must be pointed out that the saturation classes for various special 
singular integrals of functions f€C[—a, +a] or L,(—2z, +a), which can be 
interpreted as methods of summation of Fourier series of /, have been recently 
investigated by various authors. This is the case with the Cesaro (C, k), Riesz 
R(A, k), Holder (H, k), Abel and Lambert methods of summation, which have 
been treated by S. ALyanéré [2], H. BUCHWALTER [7], J. Favarp [24], A. H. 
TURECKII [34] and M. Zamansky [35]. But the methods of proof in these papers 
are often restricted to one special singular integral and cannot be generalized 
to a wider class of such integrals. 

In case the functions f are non-periodic and defined over a fixed finite interval 
(which by a linear transformation may be reduced to the interval lhe ee cs 3) 
the role played above by the Fourier transform may, for integrals of the type 
(1.3) but defined over [—1, +1], be taken over by the finite Legendre transform 


where L,,(x) denotes the Legendre polynomial of degree 7. 


In the resolution of boundary value problems by operational methods in the 
case when the variables range over a finite interval, G. Dortscn [20] first 
suggested the use of finite sine or cosine transforms. These methods have been 
extended by I. N. SNeppon [37] and especially by R. V. CHURCHILL [16]. 
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In §2 of the present paper we quote several known results which are some- 
what scattered throughout the literature and are needed in our work. In §3 we 
state the conditions the kernel k in question may have to possess, and we de- 
monstrate two direct approximation theorems, the proofs of which are free of 
transform methods. In §4 two inverse theorems are established, the proofs 
depending upon the Fourier-transform methods. In §5, the Fourier method is 
applied to direct theorems of a type similar to those of §3. There is also a dis- 
cussion on the saturation class of the singular integral (1.3). In § 6 the problems 
treated in the previous sections are extended to L,,(— oo, oo) spaces for 1<p<2, 
which are reflexive. In §7 we discuss applications of the general theorems 
presented to FEJER’s singular integral and that of Porsson-CAucHy as well as 
GaAuss-WEIERSTRASS. In § 8 the saturation classes for PIcARD’s and JACKSON’S 
singular integrals are treated, both integrals not being semi-group operators. 
Finally in §9 there is a brief discussion on the semi-group approach to the 
problem. 

§ 2. Preliminary Theorems 

The following known lemmas, needed in the succeeding sections, will be 
stated for completeness. The first concerns conditions under which a complex- 
valued function /(v) of the real variable v admits a representation as a Fourier- 
Stieltjes integral. 


Lemma 2.1. Let h(v) be integrable over every finite interval. A necessary and 
sufficient condition that h(v) can be represented almost everywhere as 


(2:1) h(o) = (1/12) f e-*** dg(x), 
where g is of bounded variation in (— ov, 00), 18 that 
R 
(2.2) (WV2z) f (1 — 121) e*h(v)dvl| =0(1), 


for all O<R<oo. If h(v) is continuous, the representation (2.1) holds for all real 
v (—w<v<oo). 

This is a particular case of a general result due to H. CRAMER [19] for which 
one may also refer to A. GONZALES DoMINGUEZ [26] or J. L. B. Cooper [J7, 


Theorem 5]. 

Needed also is the following fundamental result on the product of two Fourier- 
Stieltjes transforms, a proof of which may be found in D. V. WIDDER [35], 
p- 2515-2553 

Lemma 2.2. I} g, and g, are of bounded variation on (— 0, x) having Fourter- 
Stieltjes transforms &,(v) and 8(v) respectively, then 


(2.3) &1(¥) &o(v) =P(v)  (—we<v<o) 
and 

(2.4) [Var 9] 0 S&S [Var 81] P00 [War g2]Po0- 
Here 00 


(x) = (1/22) S g(x —u) dge(w) 


—c 
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is the bilateral Stieltjes convolution of g, and g,, and [Var g}"., denotes the total 
variation of g over the interval (— ©, u). 

In the case when f€L,(—00, 0), 1<p<2 a Fourier transform must be 
defined. Put 


(2.5) 7, (v) = (12) F Hx) baie 


then as wo, i (v) converges in L,(— oo, 0), p+q=f9q, to a function }(v) 
(using the same notation as in L,- ie) Sails the Fourier transform of f(x). 
This function is known to satisfy the inequality [33, p. 96]; [37, Vol. II, p. 254] 


2.6 (ae) f Foleaef’s (naa) f snltas” 


This inequality is the extension to Fourier transforms of the classical inequality 
of Hausdorf{-Young in the theory of Fourier series. In the case =q=2, formula 
(2.6), with the inequality replaced by equality, is the classical PARSEVAL-PLAN- 
CHEREL formula. 

The extension of Lemma 2.1 to L,-functions is given by: 

Lemma 2.3. Let h(v) be integrable over every finite interval. A necessary and 
sufficient condition that for almost all v 


h(v ) = Lim. (1/ 3m) Sex Cet os 


where g(x) CL,(— 00, o), 7s that 


(2.7) |one=) fle—E ae ei?*dy 


—R 


Lp 


(9) 
for all O0<R<co. (Here l.i.m. denotes the limit in the mean of order q.) 


A proof of this lemma was kindly communicated to the author by J. L. B. 
CoorPeR, who will publish it. 


Lemma 2.4. If (x) and }'(x) are absolutely continuous and 


64) 7 Oe E Ctca recy ie Ete ps 7 


then 
g(v) =(¢ v)? f (v) , almost all v. 


For this lemma, one may consult BOcHNER & CHANDRASEKHARAN [4, p. 124, 
p. 215] — it is the L,-analogue of Theorem 3 (ii) of [5, p. 8]. 
The converse to this lemma is given by 


Lemma 2.5. If {€L,(—oo, o) and gE L,(—~, «) for 1<p<2 and 
(iv)? f(v) =@(v), almost all v, 


then f(x) and f'(x) are absolutely continuous and rales) Re LG (90 oo). 


d 
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For this result see [8, p. 128—129, p. 215]. 
Lemma 2.2 in L,-space is given by: 


Lemma 2.6. Let g,€1,4(— 00, 00) having Fourier transform %,(v) and gC 
L,(—o, ~), 1<pX2 having Fourier transform %,(v) EL, (—~, ~). Then 


(2.8) 81(v) &(v) = (2) 
and 

(2.9) le llcg S Neale, gel,» 
where 


g(x) = (1221) fasts — U) go(u) du. 


For a proof, one may consult TircHMARSH [33, Theorem 65, p. 90 in the case 
p=2 and Theorem 77, p. 106 for 1<p<2]. 


§3. A General Class of Singular Integrals 
Let / be a (non-periodic) function of class L,(— oo, oo), and let 
(3.1) U(x) = (@/V2x) Sf t(wt+uyk(qujdu — (g>0) 


be a general singular integral with kernel k(u) having some or all of the pro- 
perties: 


i) k(u) is a non-negative function of the real variable wu, —o<u<-oo such 
that k € L,(— x, ov); 

ii) k(u) is continuous at w=0 and k(u)=k(— uw); 

iii) (1//22) f k(u)du=1; 
andeither 

iv) w,=(1/V22) fuvk(u)du<+o, y>0 

0 

or the weaker condition 

iv)* there exists a majorant k* of & satisfying iv) 
and either 


v) k(u) is monotone decreasing for 0 S u < 
or 
v)* there exists a k* € L,(— co, oc) such that |k(u)| < k*(u) and &* satisfies v). 
Lemma 3.1. Let fC L,(—~, ~); let k satisfy the conditions i), 11) and iii). 
Then J,(x) as a function in x exists for almost all x and belongs to L1(— 0°, o) and 


6.2) Well, = (N22) SIRO ax If 
The proof is elementary; see also [Z/, §2]. 


Lemma 3.2. Let fC L,(—%, o), let k satisfy the conditions i), ii) and iii). Then 


(3.3) Ilo) —Fll.>9, ote. 
The proof is similar to [33, p. 35]. 


It: 
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We now prove a direct theorem on the degree of approximation of /(x) by 
the J,(x) for large g. It is the L,-analogue of Theorem 2.3 of [/1], the proof 
being entirely independent of Fourier transform theory. This theorem is to be 
compared with Theorem 5.1 of §5 below. 


Theorem 3.1. Assume that fC L,(— 0, 00) and that k satisfies the conditions 
i)—iii) and iv)* and v)* (im particular iv) and v)). Let 


6.4 g (sw) = f(x +m) + fe —») — 2f(2), 
6.5) wo(u) = (1/122) flee w)| ax. 


Then the condition 
h 


(3.6) fw(u)du=o(ht’”), ho 
0 
implies that 


(3.7) IL (#) — F(2)| 


Proof. Since the kernel # satisfies i) and ii), we have 


1,=9(e~”), a too. 


ot {Wht “itll dase f ach flats mlerion an} 


co ) oo 
Si J b*(qu) w(u) du = oh? ( J +f ) teu) wl) du 
=1,+I],, say. 


Defining 


we then integrate J, by parts, proceeding just as in the beginning of the proof 
of Theorem 2.1 of [Z/], and prove that to every e>0, there is a dg such that 


[Zi1<3 e+ y) Vaamu,, 656), all e>0. 
Then one fixes the 6 and considers J,. 
Since {€L,(—oo, 0), w(u) <4 || ||, =M, it follows that 


Zo] SM eh” [ k*(Qu) du 
to) 
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where we have used the fact that w, is finite. Thus 


fit —f|dx <e3 (ity) a, +44 


for 0= @p (6 being fixed). The proof is complete. 


Theorem 3.2. Let f and k satisfy the assumptions of Theorem 3.1 except that 
the condition (3.6) ts replaced by 


(3.8) Fro(u \da=OH 0. 40, 
Then 
(3.9) Il Jo(*) — F(z, =O(07”), eto. 


A slight modification of the proof of the previous theorem gives this result. 

It may be remarked that the small-o as well as the large-O terms in (3.7) 
and (3.9), respectively, are independent of x. 

It is important to note that Theorems 3.1 and 3.2 are of interest only in 
case OSyX2. This is in some sense revealed by the following theorem, the 
method of proof of which is one particular instance of the Fourier-transform 
method explained in §1. 


Theorem 3.3. Let {EC L,(—, o). If 


(3.10) bin 5 = fie LF ( t) — 2f(x)|dx=0, 
then f(x) =0 for almost all x in (— ov, 00). 
Proof. Since 3 [f(x-+t)]=e'*' /(v), we have 


SU (x +8) + F(x — ft) — 2f(x)] = (et + e-#** — 2) f (0) 
= (2 cos vt — 2) f(v). 


Thus according to (3.10) 
|2[cos vt — 1] f(2)| S (12x) Se +4 + f(% —t) — 2f(x)Je**| dx 


o (2?) , t—>0. 
It follows that 
lim — (cos vt — 1) f(v) = 


t—>0 


so that (v?/2!) }(v)=0 for all v, and the uniqueness theorem for Fourier trans- 
forms implies that /(*)=0 for almost all x in (— oo, o). 

The proof reveals that the Fourier-transform method may sometimes be 
successfully applied to problems? which are not exactly of the type discussed 
in §1. 


2 See also ButzER, On some theorems of Hardy, Littlewood and Titchmarsh (to 
appear). 
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§ 4. Inverse Approximation Theorems 
First we prove a converse to Theorem 3.1. 
Theorem 4.1. Let f€ L,(— ve, «); let k satisfy the conditions i)—iii). Suppose 
there exist constants c>0 and, O<yX2 such that 
’ o” | = 
(4.1) lim 72 [1 — h(vfe)] =e > 0 


for every real v, —co<u<oo. Then 


(4.2) I|Jo(*) — F(a) |, =0(e%), (et ) 
implies that f(x) =0 for almost all x im (— 0°, co), 

Proof. We remark that in view of iii), k(0)=1 and |k(v)| <1. Thus the 
constant ¢ in (4.1) is positive. 

We first note that in view of Fuprnr’s theorem it can readily be shown (see 
also [17], §5) that 


SU, (=f) ke) = (— o< <0). 
This gives 


(4.3) SLf(*) — G(*)) =f) 1 —/0)], 
and according to (4.2) 


(1 — ole) Fel S N22) FI) —Lle**]ax= 010%), oho. 
Applying (4.1) shows that 
He [1 — k(v]o)] v) = el o|* fv 


for all real v, and the uniqueness theorem for Fourier transforms gives /(x)=0 
for almost all x in (—oo, oo). This establishes the theorem. 


We note that the method of proof of the latter result is essentially that of 
Theorem 5.2 b) of [//]. The converse to Theorem 3.2 is given by 


Theorem 4.2. Let f and k satisfy the same assumptions as those of Theorem 4.1, 
including the condition (4.1), except that (4.2) is replaced by 


(4.4) I Jo(*) — fl, =O(0) (et ©). 


Then there exists a function g(x) of bownded variation on (—oo, co) such that 
clv |"F (0) (v) for every real v. 


Proof. * consider the partial integrals 


(4.5) Sy (x) = (1/V2m) f [1 — k(v/Q)] flv) ef?* dv, 


which can be written as 


Sw (x) = (1/22) fede (1/22) foe) — J,(u)] ented ys. 


—w —c 
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Since the inner integral converges uniformly when —wXv<-+w, where w is 
arbitrary, the interchange of the order of integration gives 


co 


So (2) = (1/2) f UF (4) —Jy(u)] du fe dv 


(4.6) 
= (tlm) f U4) — Joly] PP a. 


We introduce the (C, 1) means of the partial integrals s,,(x), namely 


R R 


(4.7) oe(x) = (4)R) [ sy(x) dw = (1/V2z) f (1— 121) 1 — Role) fe) eae. 


0 —R 


These Cesaro means og(x) may also be written, upon applying the second repre- 
sentation (4.6), in the form 


on(2) = (1/8) [ {: f tA — foley] SEE") dul dw, 


and inverting the order of integration in the repeated integral, we find that 


on (2) = (Ulm) f [¢(n) — J (0)] REO? a, 


In view of Lemma 2.1 and the hypothesis (4.3), it then follows that 


*) |r, = ||7(#) = J, (7) llc, =O”), 


Il or ( 

yielding (by (4.7)) 
R 
(/V2z) f (1—4EL) (1 —kw/o)] fo) ef av 


IR 


(4.8) 


where the large-O term is independent of R and o>0. 
On the other hand, by (4.1) it follows that there exists a constant M>0 
such that 
|o”[1—R(vJo)]] SM»o|” (eZ @; |v SR), 
and thus 


(1 — 12) or tt — foley] fle) e*] < 20)” F0) 


for 0=6, and |v| SR, every R>0O. Now the right-hand side of the latter in- 
equality is integrable over every finite interval |v|<R, and thus LEBESGUE’s 
dominated convergence theorem yields 


R R 


6 afl Wow toorine tem fle eorineen 
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By Fatou’s lemma it follows that 


if (1//2s) fl = ie lol f(0) sive duldx 
= iim ¢” ies fl ee i) ‘= k(vJo)] /(v) dul dx 


which is finite according to (4.8). Thus 


(1/22) fl Lal) Jol fle) dy 


= 


=0(1), 


I, 


the large O being independent of R. , 
We finally apply Lemma 2.1 to the continuous function h(v)=c|v|’ f(v) and 
deduce that c|v|’ f(v) = g(v), all v. Thus the conclusion of the theorem follows. 
We remark that an entirely different proof of the above theorem will be 
given in [14]. The above method has the advantage that it can readily be ex- 
tended to L,-functions for 1<p<2 (see § 6) and probably also to functions of 
several variables?. 


§ 5. The Fourier Method and Direct Theorems 


This section is devoted to a direct approximation theorem which is essentially 
of the same type as Theorem 3.2. But here the condition (3.8) upon f will be 
replaced by one involving Fourier transforms of the function / and the conditions 
iv)* and v)* of the kernel & in §3 will be replaced by weaker conditions involving 
Fourier transforms. The method of proof of this theorem is also based upon 
the Fourier transform approach of this paper. 


Theorem 5.1. Let fEL,(— x, ©) and assume k satisfies i)—iii) of §3 and 
instead of (4.1) the (apparently) stronger condition 


A 


(5.4) oe” [1—R (v/@) ] 


Saleh — sp(ufo) = (1/22) f ee ay (2), 


= 


where c>0, O<yS2 and w(x) ts of bounded variation on (— oo, co), and satisfy- 
ing y(— cc) =0 and w(cc) = 2a. Then the condition 


c|ol"#e) = 8) (—@<v<e), 
where g(x) 1s of bounded variation on (— oo, c), implies that 
Il Jo(*) — #(+)|1,=0 (07). 


e” [1 — k(v/e)] f(r) = 8 (ve) P(v/e), 
and by Lemma 2.2 it follows that 


Proof. We have 


8 (2) P(v/e) =G_(v) (ee <v<oo), 


3 For a solution to the problem, see ButzEr, On DrricuLet’s problem for the 
half-space and the behavior of its solution on the boundary, Jour. of Mathematical 
Analysis and Applications, 1 (1960) (in press) 
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where 
Bo) = (1102) fe-*** dep (2) 
and 2 


o(*) = (1122) J glx —u) dy (a) 


anywhere we have put y,(u)=y(0u), e>0, —co<u<co. From (2.4) we obtain 


Y 
[Var 9] 00S (4 [V22) [Var g]} 00 [Var p.] P00 
= < (1/22) M,M,=M 


since both g and p are of bounded variation on (— co, oo). Here M, (and thus M) 
is independent of x, for if x ranges through (— oo, oo), so does ox. 


In view of (4.3), we deduce that 
o” SIF(#) — Jy()] = [1 — k/e)] #0) =F (0) = 3S Lge). 


Hence by the uniqueness theorem 


=o S Uw) — hw] dw. 


(5.2) 


The theorem on the variation of an indefinite Lebesgue integral (e.g. [35], p. 20) 
then gives 


(5.3) [Var glen =o" J lil) — hoax. 


But according to (5.2), the total variation of m, over the interval (— oo, oo) is 
uniformly bounded, and thus the integral in (5.3) is also bounded by M. This 
establishes the theorem. 


Remark. If the condition (5.1) is fulfilled, so is (4.1). Indeed, 


lim g” H— Fie) — tim (4jV22) fe -ivsle dy(x) = (1/22) )f dp (x 


et oo c\v|” @ f co 


But whether the condition (5.1) is actually stronger than (4.1) remains unsolved. 
It may well be that both conditions are equivalent. 

The above proof, which replaces an earlier incomplete argument of the author, 
was established in conjunction with HErnz KONIG, who introduced the condition 
(5.4). The author is grateful for these kind suggestions. The metric (5.3) was 
introduced into analysis by S. BOCHNER [4]. 

It would be of interest to express the condition (4.1) directly in terms of pro- 
perties of the kernel & itself. This question is now being investigated by H. KOn1G 
[29], who has just proven the following result: 


Lemma 5.1. Let k satisfy i)—iii) and 0<y<2. Then the condition (4.1) ts 


equivalent to 
=|? in ZY 
te fh u) du =e P(y) sin ae 
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If in addition k satisfies v), then (4.1) ts equivalent to 


lim #2 k(t) = \/2ero +4) sin 2, 


tt co 


A comparison of the hypotheses of Theorems 3.2 and 5.1 is also of interest. 
We first consider the case y=1 and compare the conditions iv) of §3 with (4.1) 


and (5.1). 
If 
(5.4) Quy = (1/V2z) S |x| B(x) dx <0, 
then the existence of 
: [1— (v/@)] a 0 | 4—ei?4le h ped ak 
(5.5) Bae lv ie ee 4 ons ea (x) dx 


follows. But the converse does not hold in general. Indeed, consider FEJER’S 
kernel of § 7; the limit in (5.5) exists as well as (5.1), but the first moment in 
(5 4) does not exist. Thus the assumptions of Theorem 3.2 seem to be stronger 
than those of Theorem 5.1 in the case y=1. 


But in the case y=2, this is not quite so. If the second moment 


2 Ue = (1/V2) J 22 (2) ax 


is finite, then 


(5.6) lim ¢? a =—(1/V2x) [ 22k(x) dx. 


Conversely, if the limit in (5.6) exists, so does the second moment (compare 
[6], p. 71). Thus the essential difference between Theorems 3.2 and 5.1 is that 
the assumptions (3.8) and v) (or v)*) of § 3 must be compared with the 


condition c v? f(v)=¢g(v), for all v, of Theorem 5.1. 


Theorem 5.2. Let 


Tele) == [ fe tw) (qu) du 

be a given singular integral corresponding to a function fC L,(— 0%, «), where the 
kernel k satisfies the conditions i)—iii) of §3 and also the condition (5.1). Then 
the integral J,(x) 1s said to be saturated with order O(o~”), and the saturation class 
as the class of functions f for which 


(5.4) clv|" flo) =8(v) (<0 <0) 


where g ts of bounded variation on (= 00, 00), and c= 0) Oy SS 2, 
In other words, a necessary and sufficient condition that the difference 
D,(J; f) is exactly of order O(0~”) is that the condition (5.4) be satisfied. 


The proof follows immediately from Theorems 4.1, 4.2, 5.4 and the definition 
of a saturation class given in § 1, 
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§ 6. Approximation in the Space L,(— ©, 0), 1<pX<2 


In this section we proceed to extend the foregoing results from L, to L p-space. 
For this purpose we have already introduced the Fourier transform in L, in §2. 


Lemma 6.1. Let {CL,(— 0, 0), p=1; let k satisfy the conditions i), ii) and iii). 
a) Then the integral J,(x) exists for almost all x and belongs to Ly(— ©, 00), and 


Il (*)Ile, = (1/02) Fela)” Site: 


ILe(*) —Fyllzp> > (et ©). 
For the proof one may consult [5, p. 98—101]. 


b) Also 


The extension to L,-functions of Theorems 4.1 and 4.2 is the following. 


Theorem 6.1. Let fC L,(— ~, 0), 1<pS2, let k satisfy the conditions i) —iii), 
and suppose there exist constants c>0, O0<ySX2 satisfying (4.1) for every real v. 


a) If 


(6.1) I2(*) — fl, =e) = (et >), 
then f(x)=0 a.e. in (— ov, ov). 

b) If 
(6.2) IlJo(*) — F(*)|lz, = 9 (0), 


then c|v]”f(v) as the Fourter transform of a function g in L,(—, ov). 
Proof. a) It can readily be shown (compare [37, Vol. II, p. 253]) that 


SLL (#)] =F(v) &(e/e), 
and thus in view of the Hausdorff-Young inequality (see § 2) 
[a 2%) F170) 11 — kofonnteaef"s [(1)V2x) F140) — Leola] 
The hypothesis (6.1) gives 
lim, 0? (a2) J f(o) 1 — R(ofe)]|¢ae} "= 
From (4.1) and Fatou’s lemma it follows that 
[(1/2x) fet |v" o|eav]'"=o, 


and thus c|v|”f(v) =0 for almost all points in (— oo, co 
the uniqueness theorem is needed to show that f(x) =0 


b) We consider the partial integrals 
Sy (x) = aio 1 —k(fo)ledv = (0<w< om), 
The transform of /(v =e'**(|v|<w), 0(|v|>w), is 
x 10% pivUu es 2 sin w(4#-+u) 
= (1//2z) fe CoM dom IE Guay ee 


Arch, rational Mech. Anal., Vol. 5 


). Only application of 
a.e. in (— 00, oo). 


27 
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Hence, by PARSEVAL’s formula [33, p. 70 for P=2, p. 105 for 1<p<2] 


(oe) 


(12x) f fo) 1 — R(ofoy) edo = (alee) f Uf) — (wy) PEE a, 
It follows that 
so (#) = (t/oe) f LA) — Joe] PEE du. 
As in the proof of Theorem 4.2, we now consider 
v(x) = (AIR) f so(x) dw = (1/V2m) f (1—1EE) fo) [1 — Roje)] e* ae. 
0 —R 
The transform of ‘I 
ama(t—wer hi<® 
0 (fae ie) 


1s 


- ete 
mo = (1/2) [ (1g )etretdy = |Z Se 
=e 


Thus again by PARSEVAL’s formula 


22) ) {fe ) [1h (wo)) (1 — here de 


R(x—u)/2 


= (1m) f UF) — ha) ae? ae. 


In view of Lemma 6.1 and the hypothesis (6.2) we have 


lon (*)|Iz, S [lF(%) — (lz, =O (0), 


(1/V2e) f (1— 42!) fe) 1 — belo) ee ae 


—R 


mea (OE 


Ly 


the large-O term being independent of R and o>0. 


We proceed now as in the proof of Theorem 4.2 obtaining (4.9). By Farou’s 
lemma it then follows that 


Kate [l—lN)clopiayensan 


YOan2x) { (1— Efe 1 — Ree) et** ae 


me R 


Lp 


< +o. 
Lp 


Approximation and Fourier Transforms 405 
Finally, applying Lemma 2.3 to the function /(v)=c| v|’f(v), we deduce that 
a (9) a . 
(6.4) c|v|” f(x) =Lim. (1/V2z) f g(x) eW?* dx 
@ 00 = 55 


for almost all v, where g€L,(— ov, 00). This completes the proof. 


Theorem 6.2. Let f€L,(—, ~), 1<p<2, and assume k satisfies i) —ill) of 
§3 and 


(6.5) o” [1k (v/0)] = W(v/@) 


elu” 


I 


(1/V2m) f e*edy(x), 


where c>0, 0<yS2, 


and 


Then the condition 
(6.6) cl ol” f(r) = g (2) 
where g(x) €L,(— 0, oo), wmplies that 


Il Jo(%) — F()IIz, = 9 (07). 
Proof. We have 


g'[1 — A (ole)) Flo) = Pole) @(e) = (1/27) f oalen) edu. g(a. 


By Lemma 2.6 it follows that the product of the two transforms equals 


Go(e) =SlMe(2)], G(x) = Fols x —u)a(ou) du, 


Also 
(6.7) IIPellen = Welle, ¢ lel. 
where we have put «,(u)=a(9 u), 9>0. 
Now 
p(@ *) =e fa (gu) du, 
and thus 


(1/22) [Var y,]=o = @ lll. 
where we have put y,(u)=yp(g 4), @>0. According to (6.7) it then follows that 
Pelle» S (4/V22") [Iellz, [Var Ye]200S My Mz=M, 


since g€L,(— oo, oo) and p is of bounded variation. Also M, is independent 
of @ since @x varies through (— oe, oo) if x does. Now 


0” SLf(%) — J, ()) =e’ 1 — R(v/0)] fe) =, (x) = Sly, (4). 


Dif 
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By the uniqueness theorem for L,-transforms we have 


@ (t(*) — Jo(*)] = Ge (*) 
Since ||y,||z, SV, we find that 
e’ |F(*) —L(#li,S™, 


where M is independent of o, establishing the theorem. 


As an immediate conclusion of these theorems we have 


Theorem 6.3. Let {€L,(—~, &), 1<p<2, and assume k satisfies the con- 
ditions i)—iii) of §3 as well as the condition (6.5) with p(x) and «(x) as defined 
there. , 

a) If ||f(x) —J,(*)|l:,=°9(07”), @ t 00, then f=0 a.e. 

b) The following statements are equivalent for J,(x): 

i) |) — Toll, =e) (et); ; 
ii) there is a g in L,(—o, 00) so that %(v)=c|v|’f(v) for almost all v. 


In § 9 this theorem will be compared with a fundamental theorem on semi- 
groups in the case J,(%) is a semi-group operator. 


§ 7. Applications to Various Singular Integrals 
Consider Fejér’s integral: 


co 


a 1 sin(m+1) u/2 
Ble rercesn eg are 
for f€L,(—oo, oo), and let D,,(o; f) =||o, (x) —f(4)|lz,- 
As a first application of the general theorems established above, we have the 
following 


“Hx 4 u) du 


Theorem 7.1. Let fE€L,(— oo, ). 

a) If D,,(o; f)=o(1/n), n> co, then f is zero almost everywhere in (— x, ov). 

b) If D,(o; f)=O(1/n), then there exists a function g of bounded variation on 
(— 00, 00) such that 
(7.2) [ol fo) =8@) — (— @<v<om). 

c) If the condition (7.2) is satisfied, g being a function of bounded variation 
on (— oo, 00), then D,,(o; f) =O (1/n). 

In other words, FEJER’s integral is saturated with order O(1/n), and its 


saturation class is the class of functions f for which | v| /(v) is the Fourier-Stieltjes 
integral of a function g of bounded variation on (— ov, 0), 


A 


Proof. The kernel k with its Fourier transform k, in case of FEJER’s integral, 
is given by 


hk (u) = 1 (eee y. iw =|" (| 


eal) vl A 
Van \ 4/2 0 (|v| 214). 
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Here g=(n-+1), y=1, and the condition (4.1) takes on the form 


(7.3) tim Ot 2 (= (— 0<v<o) 


nt oo |v] 


with constant ¢ equal to one. Upon applying Theorems 4.1 and 4.2, parts a) 
and b) then follow immediately. 


To prove part c) one must show (5.1) is satisfied. Indeed, according to Fourier 
transform tables (e.g. ERDELYI e¢ al. [22] p. 18 and p. 41) 


Ri 4| for ‘calae 
In #¥ 
\2 ee — Ci(x))} eos (x aed ee n+1 

U 


for rae ei be 
where 
Cif =— [Sa 
is the cosine integral. This gives 
= ie 1 for sat =4 
sin ¥ . —t%—— 
—Ci(x))\ AL dy 
See sag for Sie 
|v] n+1 
Hence the quotient 
1 for |» = 
n+1 i ( v n+ 1 
[| BS: DADS ee 
|v| m+1— ’ 


may be expressed as a Fourier-Stieltjes integral with 


[Eilts —cimyen 


where y(— co) =0 and (co) = )/2z, since 


so that the theorem is established. 


It would naturally be interesting to express the condition (7.2), given in 
terms of Fourier transforms, directly in terms of the original function space. 
This leads to the following result, which I have not been able to locate in the 
literature: 


Conjecture A: a) If fE€L,(—o0e, 00) and if (7.2) ts satisfied for every real v 


where g is of bounded variation on (—oe, 00), then the conjugate function f(x) ts 
of bounded variation on (—o, ~). 


b) If fEL,(—, ) and g(x) = f(x) ts of bounded variation on (— oo, oo), then 


5 (v) =|] f(v) (— o<u<oo). 
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Here 


} (x) = — lim Lf eccaer = Nas 


e|0 a 


Supposing that this conjecture can be verified, it would follow that the 


saturation class of FryéR’s integral is the class of functions f for which f(x) is 
of bounded variation on (— o, ov). 

Let us now proceed to the case f€L,(— oo, 0), 1<p<2. An application 
of Theorems 6.1, 6.2 gives 


Theorem 7.2. Let f be in L,(— ov, ~),1<pSX2. Then the following statements 
ave equivalnt for FEJER’S integral : 


1. Ilo, (*) —f(*)||c, = 9 ( (1/7 1) (n + 0°) ; 
2. theres a gin L ay oo, ph so that ia (v) =|v| i(v ) for almost all v. 


Furthermore, if ||o,,(x) , then f must be zero almost everywhere 
in) (00700); 

We may remark that results analogous to those of Theorems 7.1 and 7.2 
have occured in the literature (e.g. ALExITs [/]) only in the case when f is a 
periodic function of period 2a, fC L,(—2, 2),p21. For the proof of part a) of 
Theorem 7.1 see also [17, § 6]. 

It may be mentioned that the semi-group methods of [8, 9] cannot be applied 


directly to FEJER’s integral, since it is not a semi-group operator. 


The Potsson-Cauchy singular integral is defined by 


2 EE ew) 
p(x, y) a Pt ? 
with 
Pees | “ 
HANS |e ig Arc 


Here e=1/y, y=1, and (4.1) takes on the form 
im ype ri ed 
1 Lee SL a ae Wee aes 
Regarding the condition (5.1), we note that 
1 
a4 —— ele |2| 
ria ] 


may be expressed as a Fourier-Stieltjes integral with 


(1/22) ) fs (*% aa) du, 
where y(— cc) =0 and p(co)=|/2m. This follows from the fact that ([22], p. 18) 


Py ii vet 1—e 7" 
Vel ae Joverger ee ay, 
0 
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From Theorems 4.1, 4.2 and 5.1 then follows readily 


Theorem 7.3. Let fCL,(—o, «), D,(p; f)=||b(x, v) —f(x)||z,.. Then the 
following statements are equivalent for the Poisson-Cauchy integral: 


1. Dy(p,f)=Oly) (yf 9); 

2. There 1s a function g of bounded variation in (— oo, 0) so that g(v) = | v| f(v), 
all v. 

Furthermore, if Dy(p; f)=0(y), y | 0, then f ts zero a.e. in (—oo, ov). 

In the case of L,-functions we have 

Theorem 7.4. If fC L,(—~, ~), 1<pS2, then the Poisson-Cauchy integral 
1s saturated with order O(y), y | 0, and its saturation class 1s the class of functions f 
for which |v| f(v) is the Fourier transform of a function gC L,(—, o). 

The Gauss-Weierstrass singular integral, given by 


has its kernel k with corresponding transform given by 
Ra) —V2e-", k(v) sad aul nk 
Here @ = (2 Jt) +, y=2, and (4.1) is of the form 


1 


Fr ine 1/9 are Woaee  E 


lim 
+10 


Concerning (5.1), we show that 


1 


refi [A a gr] 


can be expressed as a Fourier-Stieltjes transform with 


w(x) =2)2 fle — \nuErfe u) du, 


—co 


where 
Erfe x = (2//a) fe“ du, 


is the complementary error function. This follows from the known formula 
([22], p. 15 and p. 40) 


Si eats 2 = r 1 -to 
\72 fe 2 (e-* — |x x Erfc x)} cos (2 a rea aad tT) (a0): 
0 
We also note that y(— 0) =0, p(o) = 22, the latter following from 


fuErfcudu=0. 


—co 
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Theorem 7.5. Let f€ L,(— 00, 0). Then the following statements are equivalent 
for the Gauss-Weierstrass integral : 

1. (Wes) —fl,=00 (40); ; 

2. there is a g of bounded variation on (— oo, ce) so that g(v) =v? f(v), for all v. 
Furthermore, if \\W(t; x) —f(x)||,,=0(t), then f ts zero a.e. 

In the case of L,-functions, we may even characterize the saturation class 
in terms of the original function-space; thus we have 


Theorem 7.6. If {EL,(— 0, 0), 1<pS2, then the Gauss-Weverstrass singular 
integral is saturated with order O(t),t 0, and its saturation class 1s the class of 
functions f for which f and f' are absolutely continuous and the second derivative 
}''(x) ts in L,(— ©, o). 

Proof. From Theorems 6.1, 6.2 and 6.3 one obtains the saturation class 
expressed in terms of Fourier transforms. Then one makes use of Lemmas 2.4 
and 2.5 to obtain the characterization of the saturation class in terms of the 
original functions. 

Remark. The Poisson-Cauchy integrals as well as the Gauss-Weierstrass 
integrals are semi-group operators. Thus semi-group theory may be applied to 
give entirely different proofs of Theorems 7.3—7.6. In the case of the reflexive 
Banach space L,(— oo, oo), 1<pX2, these have been given by the author 
[8, p. 4214423] and for L,-space by K. DE LEEuw [30, §5,6]. In the case of 
non-reflexive Banach spaces, K. DE LEEUW [30] has presented an elegant gener- 
alization of the semi-group methods of [8, 9]. But the Fourier transform method 
seems to have many advantages, especially in L,-space. It is shorter and more 
direct. 

§ 8. Further Applications 


The object of this section is to present two further singular integrals which 
are not semi-group operators, so that semi-group methods cannot be applied 
directly to them. 


Consider Picarp’s singular integral (see also [11, § 8]) 


Pig st) se a f te + u) elt du, 


with 


k(u) at ee k(v) / a 3 


Theorem 8.1. If {C€L,(— eo, oo), then Picarn’s integral is saturated with order 


O(#?), t | 0 and its saturation class is the class of functions f for which v2 q (v) ts the 
Fourier-Stieltjes transform of a function g of bounded variation on (9) 00) 


Proof. For this singular integral e=4/t, y=2, and (4.1) is of the form 


: 1 4 
ae Dare tae] = Lurene): 


We also note that 


a(t 1 1 
v? eer hs 1+(vt)? 
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may be expressed as a Fourier-Stieltjes integral with 


v(x) = |/% fermau, 


where y(—oo)=0 and p(co)=|/2z. By Theorem 5.2 the proof is complete. 
Conjecture B: a) If fEL,(— ow, 0), g is of bounded variation in (— ov, 00) and 
(iv)2f(v) =g(v), for all v, 
then f'(x) 1s of bounded variation on (— 00, ov). 


b) If fEL,(—, o) and g(x)=}'(x) is of bounded variation on (— oo, oo), 

then J Pw. 
g(v) = (cv)? f(v), for all v. 

Now if this conjecture holds, the saturation class of PICARD’s integral is the 
class of functions f for which f'(x) 1s of bounded variation on (— oo, o). 

For L,-functions we have 

Theorem 8.2. Let fCL,(— ~, ~),1<p2. For Picarv’s integral the follow- 
ing statements are equivalent: 

A. ||P(x; ) — f()ll, =O (8), (¢4 0); 

2. f and f’ are absolutely continuous and f’’ 1s in L,(— ox, ). 

Furthermore, if ||P(x; t) —f(x)||1,=0 (@), then f is zero a.e. 

For the Jackson-de la Vallée Poussin integral corresponding to the function f 


co 


rie 8 
Jon(*) 4a(n+1)3 A 


Theorem 8.3. Assume f€L,(— oe, co). JACKSON’S integral is saturated with 
order O(1/n?), m + 00, and its saturation class consists of those functions f for which 


sin (+1) u/2 . 
u/2 


(x+4u)du, 


we have 


(3/2) v2 f(v) is the Fourier-Stieltjes transform of a function g of bounded variation 
on (— oo, oo). 
Proof. Here e=(n+1), y=2 
1— Fe +3le (|r| S14) 


=e eS sin u/2 \4 ; =, Lio =e eaioieeo. 
Mi— rel |) =) E2—JoP) 2 (1s| 82) 
0 (|o] 2 2), 
ang (m+ 1)? b 
n UV ss 
i gre aye _ Gal = 1 for all v. 
Xe) 
an a for : [S14 
2\n+1|’ es) = 
(n+1)? v oe yee nets 3 = v )} f 1 S| 0) |s2 
(3) v? rq r “al v : al n+1 )p yact | Gece 
U 
setae > 
0 for Pesce 


may be expressed as a Fourier-Stieltjes integral if one applies the result of 
BEURLING [3, p. 349] below. 
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Lemma 8.1. If /(v) is absolutely continuous, and if both h(v) and h'(v) belong 
to L,(— ©, 0), then ms 
h(v) = (1/V22) fe *de(s), 
where g(x) 1s absolutely continuous. 

It is readily seen that the conditions of this lemma are satisfied. By Theorem 
5.2 the result follows. 


According to Conjecture B, the saturation class of JACKSON’s singular integral 
in the space L,(— ©, oo) would be identical to that of Prcarp’s integral. 


For L,-functions we have 

Theorem 8.4. Let f€L,(— 0, ~),1<p<2. For Jacxson’s integral the follow- 
ing are equivalent : 

2. f and f' are absolutely continuous and f' 1s in Ly(—oo, ov). 

Furthermore, if || Jon(x) —f(*)||c,=0 (1/n?), then f is zero a.e. 


§9. A Comparison with the Semi-group Method 


We shall point out very briefly the connection between the Fourier transform 
methods of this paper and the semi-group methods. For this purpose we at 
first need several definitions. 


Let X be a Banach space having elements f with norm ||f||. Let 
S={T(): 050 <9} 
be a family of bounded linear operators on X having the properties: 


a) T(t, ee = T(G) T(C2), T(o) f=f, all fEX; 
b) For 6) f —f\l =o, all: fee 


The ini euimal generator of © is the operator A defined by 
4 f=lim ay 
for all /, for which this limit exists ” the strong topology). The domain of A 
is denoted by D(A). 
Lemma 9.1. Let {ECE X. 


a) If \|T(Q)¢—f||=0(), C0, then Af=0 and T(C)f=f, all CZO0. 


‘ If in addition the Banach space X is reflexive, then the following statements 
ave ie ; 


i) TC) ff] =0 (9), 
ii) f7s im D(A). 
For a proof one may consult [8] or [28, p. 326] and for a further generali- 
zation [15]. Part a) is due to HiLLe [27, p. 321], and for b) see BurzeEr [8]. 


It is this lemma which may be compared with Theorem 6.3. Indeed, the 
singular integral 


(9.1) I() =—& [ fxm) R(gu) du 
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defines a linear operator S(o) depending on the parameter 9>0, mapping the 
functions f in the Banach space L,(— oo, oo), f=1 into itself: S(o)f=f- 


The operator S (9) has the property 


(9.2) [S(e) Sledlf=S(yEt—|f, every /ELp(— o>, ), 


provided the kernel k satisfies the functional equation of CHAPMAN-KOLMOGOROFF 


CO 


(93) £182 f b(oy(x — 1) b (on) du= 1 Pes aoa 
ik 2x Vex +e% Wana) 


If we put o=1/y(6)''”, O0<yS2, then (9.2) assumes the form 


S| : )s{ al j= 
| My Ve, y VC, 
and this equation reveals the semi-group property of the operator 7(¢): T(¢)f= 


S(Q) f. 


For example, if (9.1) is the Gauss-Weierstrass singular integral of § 7, then 
y=2, o=1/2 Vt,¢=t and the equation (9.3) is of the form (compare [27, p. 403 ]) 


1 
Tee) 


ral exp| a") exp/ a) d"= joo exp gpa) 


For an example of a semi-group operator with O<y<2 but y +1, see [29, p. 77]. 
The kernel of this operator is the so-called stable density of PAUL Lkvy. 


For further details in connection with the above we refer to [6, p. 69 etc.], 
[28, p. 566—571] and [25]. 

Lie (0,00) 1p = 2,4 comparison between the two methods is 
perhaps best revealed in the table of formal correspondences below. 


Table 
Sree EE eeae 
C0, ¢€i0 e>0, eoo((4=(y0)’) 
ITO A (x) pu Le) 
Tiere pSnler Tes Formula (9.2) 
IT@f—floo, 10) WFO, (4) 


~ 


c 
(compare [28], p. 370; p. 574—580) 


0” [R(v/o) — 1] f (v) > —elo|” f (2) 


Saturation class of 


D(A)={fcexX; Af exists and €X} Jo (%) =U EX clv|” f(v) is the 
Fourier transform of a function g¢ X}. 
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We have restricted the above discussion to the reflexive Banach space Ly, 
1<p<2, since Lemma 9.1 b) is thus restricted. For a version of Lemma 9.1 in 
the case P=1 we refer again to [30]. 


Of great importance in the Fourier-transform method is the condition (4.1). 
In a forthcoming paper [13], we intend to generalize this condition by expanding 


k (vjo) in a Taylor polynomical about v=0. This would give results corresponding 
the to those in [15] for L,-space. 
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An Application of Fourier-Stieltjes Transforms 
in Approximation Theory 


PAUL L. BUTZER & HEINZ KONIG 


Communicated by C. MULLER 


1. Introduction 


It is the aim of this paper to continue the investigations on the degree of 
approximation of functions by general singular integrals started by one of the 
authors [3, 4]. The basic idea is to treat these problems by the so-called Fourier- 
transform method. In the present paper we give a very short proof of [4] 
Theorem 4.2 (Theorem 1 below) using a representation theorem of BOCHNER [2] 
which also leads to another proof of [4] Theorem 5.1 (Theorem 2 below). A 
generalization of BocHNER’s theorem due to R. S. PHILLIPs [6] is then applied 
to obtain a certain converse to the second theorem (Theorem 4 below). The 
conditions to be imposed upon the kernel will be investigated in [4]. 

A function P defined on —co<v<o is said to be of class % if it is the 


Fourier transform a PIES Oconee 

PO) =f) SV are) yf ian 
of a function f€L,(—, o). Also P is of class § © if it is the Fourier-Stieltjes 
transform 


Plo) = (0) = (1N2m) fe-** dg (x) 


of a function g of bounded variation on —w<x<oo. For PES we introduce 
the norm 


(1.4) || Pll = Well = (1/22) [Var g]°.0. 
If PC, we have 
(1.2) Pll [fll = (22) ffl 4 = [Iflh- 
If PES, it follows immediately that 
j=1 i ied 
for every set of complex c,,..., c, and real v,,..., v,.. The converse to the latter 


fact is given by the following theorems. 
Bochner Theorem. If P is continuous on —co<v<co and if there exists an 
M>0 such that 


(1.4) > ne @,) D een 
j=1 j=1 


for all complex cy, ..., c, and real v,,...,v,,7=1, 2, ..., then PES and ||P|| <M. 


<M 


oe) 
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Phillips Theorem. If P is measurable on —co<v<oo and if there exists an 


M>0 such that (1.4) is satisfied, then P is almost everywhere equal to a function 
QE S and ||Q|| <M. 


2. The Singular Integral 
We consider the singular integral 


Jy (x) = (o/V22) f f(x —u) k(ou) du 
(2.1) far 


== aya (x — =) k du 0) 
(12x) f (x4) Rw au  (e>0) 
for f€L,(— oo, o). The kernel 2 is an arbitrary function in L,(— oo, o) with 
(1//22) f k(u)du=1. 


Clearly J,() exists for almost all x, and J,E€L,(—o, o) with || J,|h<IIfll, ||. 
Also eae lilltaeeO for 9@—>oo. 

Assume that y>0. We denote by %(y) the class of functions /€L,(—o, ov) 
for which 


IlJo—Flh=O(@%) — (Q> &). 
For y>0 we introduce the function 


1—h (v) 
Hy(v) =) |o|” 
) (v =0) 


Ke), 0 == 0. 
eed pee 
c (v =0) 


The convolution theorem for Fourier transforms applied to (2.1) gives 


i.e) =f(v) R(vJo) — (e > 0). 


Thus the function ‘ ¥ : 
0” f(v) (1 — k(v/e)) =| 2|" /(e) 2, (/9) 
is the Fourier transform of @” (f— J,) so that by (1.2) 
Ilo Fe) Hy o/e)ll = elle = fll 


According to (1.3) we have from the BOCHNER theorem: The function f€ L, (— ©, o¢) 
belongs to %(y) if and only if there is an M>0 such that 


t - 
VU; % 
Ge 4 
j=l 


(2.2) Seley) Fe) H8(,J0)| <M 


co 


forall ¢, -.4,¢, and mh 4419 2, andrevery: o=0: 
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3. The Direct and Inverse Theorems 


The Bochner-Phillips theorem provides us with a unified method of proof 
of the subsequent results. However this method seems to be restricted to L,- 


space. 

Theorem 1. Assume that H?(v)—>c=+0 for v0. Then fEU(y) wmplies that 
[uF ESS. 

Proof. We pass to the limit @->co in (2.2) and apply the Bochner theorem 
to the function P(v )=0|"F(v) 

Theorem 2. Assume that i, exists and is in & S. Then every fEL,(—~%, ) 
such that |v|’f(v) € & S belongs to U(y). 

Proof. The application of (1.3) to the function |v |” f(2) gives 


> Lo Fo) He opie clo)’ Fe,) ,(u10)|, 
(1) ; 


eH, (v0) | 


a [oe 


where M,=|||v lf (v )||. A second application of (1.3) to the function H, (v/@) yields 


Y 
2 eat %t p—t4jx/0 
Sti 


foe) 


Sige H, (0 vilo)| < <M, 
(3.2) ae 


Z 15% 
I 


7—1 foo) 


for every t. Here M,=||H,,(v)|| since for H,,(v)=g(v) we have 


Ile (w/e) || = (1/22) [Varg (ox) ]*o0 
= (1/2) [Var g(x) ]®0 = [12 (2). 


Combining the inequalities (3.1) and (3.2), we obtain (2.2) with M=M,M,, 
establishing the theorem. 
The purpose of the following considerations is to obtain a converse to Theorem 2. 


Theorem 3. Let Uy) contain a function f € Ly(— ov, 00) such that | v | (2) —+>R=0 
for |v|>co. Then A, exists and is in § &. 


Proof. Substituting v; by v,@ in (2.2), we have 


tux 


Dalyel? fee) #9(%)| < 


Now we let goo. Since H?(0)=0 


, 


r 


») c; et yx 


j=1 


M 


= 
[RI 


(vj) 


and the desired result follows from the theorem of PHILLIPS. 


Cc 
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Lemma. We have (1+ |v|”)*€% for every y>0. 

Proof. Inthe case when 0<yS1, the function under consideration is monotone 
decreasing and convex in v=0 and thus, according to TircHMARSH [7, Theorem 
124, p.170], belongs to §. For y>4, the function is in L,(— oo, oo) and is 
furthermore absolutely continuous with derivative in L,(— oo, oo). A theorem 
of BEURLING [1, p. 349] gives the result. 


Theorem 3 combined with the lemma gives the desired converse. 


Theorem 4. Suppose that every fEL,(—oo, ) with lul"f(v) ESS belongs to 
U(y). Then H, exists and is in §G. 

Proof. The function f€L,(— oo, oo) with Fourier transform /(v) = (4+ | v|”)+ 
belongs to L(y) since |v |” f(v) =1—f (v) isin §S. But |v |” f(v) 4 for |v| oo, 
so that the result follows from Theorem 3. 

We conclude by remarking that for many kernels only the case 0<y<2 is 
of interest. 

Theorem 5. Let k(u) be even and non-negative. Then U(y) contains only the 
null function im case y> 2. 

Proof. From the equation 


co 
A 


H (x _ 1—hk (2) a, heal ie Vel COS A gy) 
ara icca pre reek oe 


it follows that H})(«) occ for x->0, since y>2. Now for any function /E€(y), 
applying (2.2) for y=1, we have 


lo) [#(@)| |H3 (e/e)| SM 


for o@>O0 and all v. Letting @—ov, it follows that }(v)=0 for v=+=0. The uni- 
queness theorem shows that / is the null function. 
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Hine Anwendung des Nirenbergschen Maximumprinzips 
fiir parabolische Differentialgleichungen 
in der Grenzschichttheorie 


W. VELTE 


Vorgelegt von H. GORTLER 


Von K. Nicxet [6] wurde gezeigt, daB man auch ohne explizite Integration 
der Prandtlschen Grenzschichtgleichungen bereits eine ganze Anzahl allgemeiner 
Aussagen tiber das mégliche Verhalten von ebenen Grenzschichtstromungen er- 
halten kann, z.B. iiber Entstehen und Anwachsen von Ubergeschwindigkeiten, 
iiber die Maxima und Minima der Schubspannung, und darauf aufbauend auch 
tiber Extrema und Wendepunkt der Grenzschichtprofile. 

Diese Ergebnisse hat NICKEL mit Hilfe eines Abschatzungssatzes von NAGUMO- 
WEsTPHAL [5], [9] gewonnen, der auch schon von H. GORTLER [/] in der Grenz- 
schichttheorie verwendet wurde. Um diesen Satz anwenden zu kénnen, muBte 
NICKEL zuvor die Prandtlschen Grenzschichtgleichungen fiir die zwei Kompo- 
nenten der Geschwindigkeit auf eine einzige partielle Differentialgleichung von 
nur einer gesuchten Funktion transformieren, namlich auf die v. Misessche bzw. 
die Croccosche Differentialgleichung. Eine solche Transformation ist aber nur 
fiir ebene Str6mungen moglich. Daran hegt es, daB NICKEL seine Ergebnisse 
nur im 2-dimensionalen Fall erhalten konnte. 

Im folgenden soll gezeigt werden, daB es noch einen anderen und zudem 
sehr einfachen Zugang zu den grundlegenden Nickelschen Satzen gibt, bei dem 
eine Transformation der Prandtlschen Differentialgleichungen nicht erforderlich 
ist. Die eigentliche Bedeutung der neuen Methode besteht aber darin, da8 sich 
in genau der gleichen Weise auch viel allgemeinere Falle behandeln lassen, 
namlich die 3-dimensionalen Grenzschichtgleichungen, und zwar stationar wie 
instationar. AuBerdem kénnen an der Wand weitgehend willkiirliche Werte fiir 
die Komponenten der Geschwindigkeit vorgegeben werden, d.h. es darf in sehr 
allgemeiner Weise abgesaugt bzw. ausgeblasen werden. Dagegen wird die ein- 
schrankende Voraussetzung einer rotationsfreien AuBenstromung beibehalten. 
Es lassen sich dann fiir die 3-dimensionalen Strémungen Aussagen gewinnen 
iiber das Anwachsen der beiden Geschwindigkeitskomponenten in Haupt- 
stromungsrichtung, in denen die entsprechenden Nickelschen Satze als Sonder- 
falle enthalten sind. Dabei ergeben sich sogar noch einige Verscharfungen. 

Aus dem Institut fiir Angewandte Mathematik und Mechanik der DVL an der 


Universitat Freiburg i. Br. Diese Untersuchung wurde geférdert vom Wirtschafts- 
ministerium des Landes Baden-Wiirttemberg. 
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Als mathematisches Hilfsmittel tritt jetzt an die Stelle des Nagumo-Westphal- 
schen Satzes fiir nichtlineare parabolische Differentialgleichungen das Maximum- 
prinzip von NIRENBERG [7] fiir lineare parabolische Differentialoperatoren 
2. Ordnung, d.h. das Analogon des bekannten Maximumprinzips von E. Horr [2] 
fiir elliptische Differentialoperatoren. 

Wir beschraénken uns in der vorliegenden Arbeit darauf, die inkompressiblen 
Grenzschichtgleichungen zu diskutieren. Es sei jedoch vermerkt, daB die Methode 
weiter tragt und sogar die kompressiblen wie auch die Grenzschichtgleichungen 
in krummlinigen Koordinaten in ahnlicher Weise zu behandeln gestattet, wortiber 
in Kiirze berichtet wird. 


1. Maximumprinzip bei linearen parabolischen Operatoren 
Im folgenden bezeichnet G stets eine offene, nicht notwendig beschrankte, 


zusammenhangende Punktmenge im Raum der Variablen x,, x,,..., x, und 
ee ae temesciclben) aol eKUre 4 ——(X_, Xo, epee ING (4% |2 = Dez 
(t=1,2,...,m). Analog sind ¢ bzw. |¢|? zu lesen. 


Es werden lineare Differentialoperatoren der Form 


(4A) = = 244; (%,2) 04; 04; 
(1 fiat a eek — 102.5, 77) 


+ >) 9; (x, 8) so + Dol et) 


betrachtet. Die Koeffizienten seien in G als stetige Funktionen erklart, und die 
quadratische Form 


(1.2) Q =>) 4,;(x, 1) pA 1) 


sei in jedem Punkt (x, 2) aus G positiv definit. 7 

Alle Funktionen, auf die im folgenden der 
Operator (1.1) angewandt wird, werden in G stetig 
vorausgesetzt, desgleichen ihre ersten Ableitun- 
gen nach x und ¢ sowie die zweiten Ableitungen 
nach x. 

Zur Formulierung des Maximumprinzips tiber- 
nehmen wir aus [7] folgende Definition: Es sei 
ein Punkt aus G. Man betrachte die -dimen- 
sionale Hyperebene ¢,= const (k=1, 2, ..., m) 
durch den Punkt B und bilde ihren Durch- 
schnitt mit G. Dann bezeichnet C(P,) die zusammenhangende Komponente des 
Durchschnittes, die R, enthalt. (Vgl. Abb. 1.) 

Satz 1.1 (NIRENBERG). Die Funktion (x,t) geniige in ganz G der Ungleichung 
Lif) =0 (SO). Wenn dann f(x, t) im Punkte CG thr Maximum (Minimum) 
anninumt, dann ist f(P)=} (BR) fiir alle PEC (f#). 

Dieser Satz laBt sich auf gewisse Randpunkte von G ausdehnen. Bei NIREN- 


BERG wird es in der Form nicht ausgesprochen, ergibt sich aber aus seinen Be- 
weisen mit einer geringen Modifikation. Diese Erganzung fiir Randpunkte ist 


Abb. 1. Zur Definition von C (Fj) 


28* 
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in Satz 1.2 formuliert. Im Anhang wird noch angegeben, wie man den Nirenberg- 
schen Beweis abzuandern hat ?. 

Es werde ein Gebiet G betrachtet, das ein zusammenhangendes Randstiick R’ 
besitzt mit folgenden Eigenschaften (vgl. Abb. 2): 

(a) Alle Punkte von R’ liegen auf mindestens einer der m Hyperebenen 
i= conse 4, 2). 25): 

(b) Zu jedem Punkt RE R’ mit den Koordinaten (x9, %) gibt es ein 7 >0, 
so daB die Menge H 


H(B): |x — ml? +|¢— te 


“ —< (ik by S tro 


ganz zu G+ R’ gehort. 

(Im Falle m=1 ist H eine Halbkugel, im allgemeinen Fall Segment einer 
Hyperkugel.) Die Stetigkeit und Differenzierbarkeit der Funktionen f(x, t) baw. 
der Koeffizienten von (1.1) wird fiir Satz 1.2 auf 
der Punktmenge G+ R’ vorausgesetzt. Die quadra- 
tische Form (1.2) sei auch auf R’ positiv definit. 
H(#,) | Vehnt man die Definition von C (f/f) auf die Menge 
4 G+ R’ aus, dann lautet die Erginzung zu Satz 1.1 

beziiglich des Randes R’: 


Satz 1.2. Die Funktion f(x, t) geniigein G+ R’ 

der Ungleichung L(f) 20 (S0). Ferner set auf R’ 

stets c,(x,t) <0 (k=1, 2,...,m).Wenn dann f (x,t) 1m 

i 7 Punkt hCR' thr Maximum (Minimum) annimmt, 

Abb. 2. Zur Definition des Randes R’ dann ist f(P)=}f(B) fiir alle Punkte PEC(B#). 


2. Stationdre Grenzschichtstromungen 
Im folgenden werden Lésungen der 3-dimensionalen stationéren Grenz- 


schichtgleichungen 


; 1 
a Pz tV Uyy 
2.4 
at) UW, + vw Os : me) 

ip mp) CO Panes ee ee Wyy 


(2:2) Wu ag iO 


betrachtet. Hierin bezeichnen x, y, z kartesische Koordinaten. Wie iiblich in 
der Grenzschichttheorie ist y die Koordinate senkrecht zur Wand. Ferner ist 
y die kinematische Zahigkeit (y>0) und o die als konstant betrachtete Dichte 
der Strémung. Die Druckverteilung  ergibt sich aus der rotationsfrei vorausge- 


setzten AuBenstrémung U= U(x, 2) >0, W=W(x,z) >0 mittels 1 p4 ty (U2 W2) 
0 2 


=const. Die Funktionen U, W und die Komponenten wu, v, w der Geschwindigkeit 
sollen so oft stetig differenzierbar sein, wie es in den Differentialgleichungen (251) 


* Fur elliptische Operatoren hat schon E. Hopr [3] Randmaxima untersucht (vergl. 
auch Miranpa [4}). Fiir elliptisch-parabolische Operatoren findet man entsprechende 
Satze z.B. bei C. Pucci [8]. Die in Satz 1.2 formulierte Erganzung im AnschluB an 
NIRENBERG [7] ist speziell auf unsere Anwendungen zugeschnitten. 
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und (2.2) vorkommt. AuBerdem soll die duBere Randbedingung 
(2.3) lim (wu — U) = lim (w — W) =0 
y—> oo 


erfiillt sein. . 

Neben den Komponenten wu, w selbst werden wir noch die skalare GréBe 
u?-+-w® — U?— W? betrachten. Sie stellt (bis auf einen konstanten Faktor) die 
Differenz der kinetischen Energie eines Volumenelementes gegeniiber der 
kinetischen Energie der AuSenstr6mung dar. Zu jeder Stelle (x, z) der Wand 
(vy=0) kann man sich dann diese GréBe in Abhangigkeit von y auftragen. An 
die Betrachtung dieser ,,Energieprofile‘‘ kniipft sich eine wichtige Maximum- 
eigenschaft, die eine a priori Abschatzung von w?+ w? gestattet. 

Es werde nun eine bestimmte Lésung wu, v, w der Gleichungen (2.1) und (2.2) 
ins Auge gefaBt. Mit diesen Funktionen erklaéren wir einen Operator L durch 


(2.4) L=v—-—v —ua~——w 


Hierin ist y die ,,elliptische“‘ Variable (n=1), und 
x, 2 sind die ,,parabolischen‘‘ Variablen (m= 2). 
Die quadratische Form (1.2) lautet hier einfach 
Q=v&, ist also positiv definit. Wir wenden diesen 
Operator der Reihe nach an auf die Funktionen 
u, w, u*+w* —U2— W?. Unter Benutzung von (2.1) 


Zp! 
und (2.2) erhalt man Abbia. Zar Detiniion vom dene 
(2.5) L(u)=—(UU,+ WW) bzw. L(w)=—(UU.+WW,), 

(2.6) L(u? + w? — U? — W?) = 29(u,)? + 2y(w,)? = 0. 


Hinzu kommt noch im ebenen Fall (w=W=0, U= U(x), alle Funktionen von z 
unabhiangig), wenn Zusatzlich Existenz und Stetigkeit von u,,,,, U,,=Uy, Voraus- 
gesetzt werden, 


(2:7) L(uy) =0. 


Die Anwendung des Maximumprinzips legt nun auf der Hand. An sich lassen 
sich Satze fiir sehr allgemeine Gebiete G formulieren. Fiir die Diskussion von 
Grenzschichten sind aber vor allem zylindrische Gebiete von Interesse, auf die 
wir uns hier beschranken wollen. 

Es sei zunachst G, ein offenes, zusammenhangendes, beschranktes Gebiet in 
der x, 2z-Ebene, das gema8 Abb. 3 von zwei Geradenstiicken mit *=xp bzw. 
z=2Zp begrenzt wird, so daB fiir alle Punkte (x, z) aus Gp stets r< xp, 2<Zp 
gilt. Die Bedeutung der Rander R, (abgeschlossen) und Ro (offen) geht aus 
Abb. 3 hervor. 

Mit G werde dann der Zylinder 0<y<o bezeichnet, dessen Punkte sich 
langs der y-Achse auf Gp projizieren. R’ bezeichnet die Randpunkte von G mit 
0<y<o, die sich auf Rg projizieren. R ist die Gesamtheit aller tibrigen Rand- 
punkte von G. Es ist unmittelbar klar, daB der Rand Rk’ die Eigenschaften (a) 
und (b) aus Abschnitt 1 besitzt. Es wird nun eine in G+R-+R’ stetige Lésung 
u,v, w der Gleichungen (2.1) und (2.2) betrachtet, deren Ableitungen, soweit sie 
in den Gleichungen auftreten, in G+ R’ stetig seien. 
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In den folgenden Satzen wird an der Wand (y=0) tiber die Komponente v 
in y-Richtung keine einschrinkende Voraussetzung gemacht. Dagegen sind die 
Komponenten u, w in Hauptstromungsrichtung dort nicht ganz willkiirlich. 

Satz 2.1. Es sei u=0, w =0in ganz G+R-+R’. Wenn auferdem UU,+WW,=2 0 
ist, dann gilt sogar u> 0 in ganzG-+ R’. (Das gleiche gilt fiir w, wenn UU,+-WW,= 0.) 

Beweis. Nach (2.5) ist L(w)<0, so daB man das Minimumprinzip in G+ R’ 
anwenden kann. Wiirde nun im Punkte R,€ G+R’ der Wert u=0 angenommen, 
dann hatte die Funktion «, von der durchweg u=O gilt, dort ihr Minimum 
erreicht. Nach Satz 1.1 bzw. Satz1.2 miBte dann auch w=O sein auf ganz 
C(P,), d.h. auf der Halbgeraden += %), 2=%, OS <0, im Widerspruch zur 
auBeren Randbedingung w—->U>0 fiir y> ow. 

Im folgenden werden wir die 4uBere Randbedingung in etwas strengerer Form 
voraussetzen, namlich mit 
(2.8) slimaeag — U) = lim (@ —W)=0 gleichmaBig in x und 2. 

Das heiBt nach Vorgabe eines positiven ¢ ist es méglich das y. so zu wahlen, 
daB fiir y=, und beliebiges (x, z) aus Gy+R,+Ro immer gilt: |w—U| Se, 
|w—W|<e?. Aus (2.8) ergibt sich dann auch 
(2.9) LCs — U*) ie — "WW 2) == 0) gleichmaBig in x und z. 
Ferner fiihren wir noch ein: 
(2.10) “#=supu, W=supw, M=sup (w+ w?— U2—W?). 

PER PER PER 
Es ist immer M=O wegen u?+ w? — U2— W?->0 fiir y>oo. Ist M>0, so hat 
man _ ,,Ubergeschwindigkeiten‘‘ auf R, d.h. Stellen, in denen w—U>0 oder 
i= We Oist 

Satz 2.2. (a) Es set u=0, w20 auf R’. Die duBere Randbedingung sei fiir 
wu im der strengeren Form (2.8) erfiillt. Ferner sei 0< USu, UU,4+WW,S0 in 
G+R’'. Dann ist uSu in ganz G+R’. 

(b) Wenn an der Wand zusdtzlich u<u erfiillt ist, dann gilt sogar u<u% in 
ganz G+R’. 

(Das gleiche gilt fiir w, wenn die duBere Randbedingung fiir w in der strengeren 
Form erfiillt ist, und wenn 0<W<w, UU, WW,<0 ist.) 

Kurz gesagt bedeutet der Satz folgendes: Wenn auf dem hinteren Rande R’ 
des durchstrémten Gebietes nirgends Riickstrémung vorliegt, und wenn die 
kinetische Energie der AuBenstrémung nirgends zunimmt, dann kann eine auf 
R vorhandene Ubergeschwindigkeit nicht stairker anwachsen als bis zu @ bzw. @. 


* Man hatte diese Forderung auch anders fassen k6nnen, wie es NICKEL im ebenen 
Fall getan hat: Nach der Transformation 


sollen die Funktionen w(x, y (7), z), v(“,¥(y),z) in dem endlichen Gebiet G mit 
(¥, 2) €Gy+Ryo+ Ry und 0X7 <1 stetig sein, und es soll gelten 


jim, u(x, y(n), 2) = U(x, 2) bzw. im, DAW), 2) = We 
PEG PEG 
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Liegt auf R nirgends eine Ubergeschwindigkeit vor, dann kann auch keine mehr 
entstehen *. 

Beweis. (a) Angenommen, es gabe Punkte mit w>4%, z.B. sei w=t%i+a (a>0) 
in PC€G+R’. Dann kann man wegen (2.8) ein y, so wahlen, daB fiir alle y=, 


immer u—U< = gilt, und daher wegen USX# auch immer uwXu@#+ a 


Jetzt wird der bei y=y, abgeschnittene Zylinder betrachtet. Es bezeichne 
G, (bzw. R‘,) die Gesamtheit der Punkte von G (bzw. von R’) mit 0<y<yp. 
R,, bezeichne die Gesamtheit der Randpunkte von G,, die nicht zu Rj, gehéren. 
G,+R,+R', ist eine beschrankte und abgeschlossene Punktmenge, auf der die 
stetige Funktion wu ihr Maximum annimmt. Und zwar ist Max(u)2%#+« und 
wird in einem Punkt BR, aus G+’ angenommen. Da nun nach (2.5) immer 
L(u) = 0 ist, kann man das Maximumprinzip anwenden: Das Maximum wird auf 
ganz C,() angenommen, d.h. auf dem offenen Geradenstiick x=%), z=2, 
O0<y<‘¥, und wegen der Stetigkeit von w auch in den Endpunkten, was aber 


im Widerspruch steht zu uSi+— fir y=y,. Es ist daher w<u in ganz G+ -R’. 


(b) Wiirde der Wert u=avin RE G+R’ angenommen, so hatte die Funktion w, 
von der ja w<% gilt, dort ihr Maximum erreicht. Dann ware wieder u=4% auf 
ganz C(R), d.h. auf x=%), z=%, O<y<oo mit EinschluB des Endpunktes 
y=0, im Widerspruch zur Voraussetzung u<u% an der Wand. 

Der nachste Satz bezieht sich auf die kinetische Energie und enthalt eine 
a priori Abschatzung fiir u?+ w?. Mit der Konstanten M aus (2.10) lautet er 

Satz 2.3. (a) Es ser u=0, w=0 auf dem Rand R’. Die duBere Randbedingung 
set fiir u und w in der strengeren Form (2.8) erfiillt. Dann ist 


wtwt<U?1+W2+M im G+R’. 


(b) Wenn die Ungleichung zusatzlich an der Wand mit dem Zeichen < erfiillt 

ist, dann gilt sogar 
wtwt<U?1+W2+M im G+R’. 

Beweis. (a) Wir schreiben f=u?+ w?— U?— W?. Angenommen, es gabe einen 
Punkt P€G+R’, in dem />M ist. Es sei etwa {(P)=M-+a(a>0). Nach 
(2.9) kann man dann ein y, so wahlen, daB fiir alle y>y, gleichmaBig in x, z 
stets |f| <M+ = gilt. Nun wird wieder das bei y, abgeschnittene Gebiet G, 
betrachtet. f nimmt auf G,+R,+R, das Maximum an: Max (/) 2M-+«. Auf 
dem Rande R,, ist aber fSM-+ a Also mu8 das Maximum in einem Punkt 
P,€G+R’ angenommen werden. Nach (2.6) ist aber L(f) 20, und daher nach 


dem Maximumprinzip /(P)=/(R) auf ganz C,(A), dh. auf x =%, z=2%p, 
0<y<v%» einschlieBlich der Endpunkte, was aber im Widerspruch steht zu 


[| SM+ > fiir y2y0. 


* Zusatz bei dev Korvektuy: Fiir den Sonderfall, daB u,v, w, U, W alle von z 
unabhangig sind, findet sich bei NICKEL (ZAMM 36, 303—304 (1956)) die dort nicht 
naher begriindete Bemerkung, da8 Ubergeschwindigkeiten nicht von selbst entstehen 
k6énnen, wenn die AuBenstr6émung drehungsfrei ist. 
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(b) Wiirde der Wert /=M im Punkte P€G+R’ angenommen, dann ware 
auch f=M auf ganz C(A), d.h. auf der Halbgeraden x= % , 2=%, OS Y<00 
einschlieBlich des Endpunktes y=0, was der Voraussetzung / <M an der Wand 
widerspricht. 

Anmerkungen. Beschrankt man sich auf ebene, stationare Grenzschicht- 
stromungen, dann reduziert sich das Gebiet G auf einen unendlichen Halb- 
streifen: 

Go 4 = 4 fa, oe Ve 
Rk’: 4%, 0) < ti co 
Ree Gag O0<y<oo und 


mSeSm, y= 


Wir wollen im folgenden den Fall (b) aus Satz 2.3 betrachten, d.h. es sei [<M 
an der Wand. Dann kann man auch schreiben 
M =sup (w2(m, y) — U(x). 
0Sy<oco 
Definiert man noch die GréBe e als maximale Ubergeschwindigkeit im Anfangs- 
profil durch 
e=sup (w(%,¥) — U(m)), 


0Ssy<oo 


dann besteht der Zusammenhang M=e?+2eU(x,). Man kann daher Teil (b) 
aus Satz 2.3 auch so formulieren: 


Satz. Es sei u=0 auf dem hinteren Rande R’. Die duBere Randbedingung 
fiir u set in der strengeren Form (2.8) erfiillt, und an der Wand set u? — U?<e?+ 
2eU(x,). Dann gilt 

u<Veri+2eU(x%,)+U2(x) in G+HR’. 


In dieser Form hat NicKEL die Abschatzung fiir die Ubergeschwindigkeiten 
angegeben, allerdings nur unter der Voraussetzung w=0 in G+-R+ R’ und u=0 
an der Wand, und zwar mit dem Zeichen < in der Ungleichung. 


Satz 2.1 lautet fiir den ebenen Fall: 
Satz. Es set u=o0, U,=0 in G4+R+R’. Dann ist sogar u>0 in G+R’. 


Auch hierin ist eine leichte Verscharfung enthalten gegeniiber NICKEL, der 
an der Wand u=0 und v<0 voraussetzte. 


Eine unmittelbare Folge des Maximumprinzips ist schlieBlich der Satz iiber 
die Schubspannung w, im ebenen Fall. Man kann ihn so aussprechen: 


Satz. (a) Es set G ein beliebiges offenes, nicht notwendig beschranktes Gebiet 
im der x, y-Ebene, in dem u,v als Lésung der 2-dimensionalen Grenzschichtgleichun- 
gen defimiert ist, und in dem auch U,,,, Uyy=Uyy Stetig existiert. Wenn Uy Im einem 
Punkt REG sein Maximum oder Minimum annimmt, dann ist uy konstant auf 
ganz C (fy). Die Aussage dehnt sich aus auf G+ R’, wenn u, v samt den Ableitungen 
auf G+R’ stetig ist und wenn zusitzlich u=0 auf R' ist. 

(b) Ist uw, auf dem beschrankten Gebiet G einschlieBlich des Randes R+R’ stetig, 
dann nimmt u, sowohl sein Maximum als auch sein Minimum auf R an. 
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Ist speziell G der schon oben betrachtete unendliche Halbstreifen, und nimmt 
u, in RE G+ R’ das Maximum oder Minimum an, dann ist uy konstant auf der 
Halbgeraden x= x), OX y<oo, d.h. das Profil an der Stelle x) hat die Gestalt 
U(%, y)=ay-+ B mit «=0, B=U(x) wegen der 4uBeren Randbedingung. Ist 
aber insbesondere an der Wand u(x, 0)< U(x) vorgeschrieben, dann kann das 
Maximum oder Minimum iiberhaupt nicht in G+ R’ angenommen werden. 


3. Instationare Grenzschichten 
Wir betrachten die Grenzschichtgleichungen 


1 
U,+ UU, + UU, 4 eS eed See 1? 


1 
W+uUw, +Vw, be Eo dae 


U,+v,+W,=0. 


Die Au8enstrémung wird zeitunabhdangig, d.h.in der Form U=U(x,z), W=W(x, 2) 
vorausgesetzt und auBerdem rotationsfrei?. Dann gilt fiir den Druck die Be- 


ziehung : p- : (U?-++ W?) = const, und man findet mit dem Operator 


2 
ez é é 0 é 
L= u ms = 
i oy? Ox oy 2 ot 


fiir Losungen u, v, w der Gleichungen ganz analog wie friither 
Lu) =—(UU,+WW,), L(w)=—(UU,+ WW) 
LA a —— Ue W2) —— 29 (,)2 1 Dy (wy) t= Oe 


Die Satze 2.1—3 iibertragen sich daher ohne weiteres auf instationare Grenz- 


schichtstromungen. 
Der Anschaulichkeit halber wollen wir die Verhaltnisse am Beispiel der ebenen 


Str6émungen naher untersuchen. Die Gleichungen lauten dann 


(3.1) Uw+uu,+uvu,=UU'+yu4,,, 
(3.2) U, + Vy =0 
und der Operator reduziert sich auf 
sy ae é Oiiiane 
carte) Gye uMan iis ome ae, 


Hierin ist wieder y die ,,elliptische‘‘ Variable (1=1) und x, 7? sind die ,,para- 
bolischen‘‘ Variablen (m= 2). Fiir Lésungen wu, v der Gleichungen (3.1) und (3.2) 
erhalt man dann 

Liu =— UU", Liu? — U*) =2r(u,)?20, L(u,)=0. 


Es liegt also formal das gleiche Problem vor wie in Abschnitt 2. Man hat ledig- 
lich die drei Raumkoordinaten umzudeuten in zwei Raumkoordinaten und eine 
Zeitkoordinate. Das Gebiet G und seine Rander R und Rk’ werden wie folgt 


8 Zeitabhingige AuRBenstr6mungen U(x, z,t), W(x, 2,t) bleiben einer spateren 
Untersuchung vorbehalten. 
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erklart: Gog a ee he OR ee a ee 
Uf A SSI OV ato - tt aie ST 
pay. OS VO. Ny = Waite 


R: Die tibrigen Randpunkte von G. 


Wir betrachten wieder eine Lésung #, v der Gleichungen, die in G+ R-+ R’ stetig 
ist, und deren Ableitungen soweit sie dort auftreten in G+ R’ stetig existieren. 


Satz 3.1. Es sei u(x, y,t)20inG+R-+R’, U(x) >0 und U'S0m SKS. 
Dann ist sogar u(x, y,t) >On G+ R’. 

Das heiBt zu jeder Zeit ¢ (4-<¢Xt,) wird der Wert w=O héchstens an der 
Wand (y=0) oder im Einlaufprofil w(%,, y, ¢) angenommen. 

Fiir Satz 3.2 und Satz 3.3 wird die auBere Randbedingung wieder in der 
strengeren Form vorausgesetzt: 


(3.3) lim (4 — U) =0 gleichmafig in x und ft. 
y— oo 
AuBerdem wird gesetzt 
(3.4) a=—supu,  M=sup?— U2). 
Gels PER 


Es wird also die obere Grenze in folgenden Punkten gebildet: An der Wand 
(y=0) zu allen Zeiten ¢, dann in den Einlaufprofilen u(x,, y, ¢) zu allen Zeiten f, 
ferner zur Zeit ¢=¢, in allen Punkten x, y, d.h. fiir u(x, y,4,). Bei u*— U? ist 
es analog. 

Satz 3.2. (a) Es ser u=0 auf R’. Die duBere Randbedingung set in der stren- 
geren Form (3.3) erfullt. Ferner seo U>0, USO m %S4% Sx. Dann rst 
u(x,y,t) Su in G+R’, 

(b) Wenn zusdatzlich an der Wand u<u ist, dann gilt sogar u(x, y,t)<u im 
G+ R’. 

Wenn also keine Riickstrémung vorliegt und U nirgends anwachst (bzw. 
kein Druckabfall stattfindet), dann kann uw im ganzen Zeitintervall 4<t<t, 
nicht groBer werden als die Konstante #. 

Satz 3.3. (a) Es sei u=0 auf R'. Die duBere Randbedingung sei in der stren- 
geren Form (3.3) erfillt. Dann ist ua<U?+M in G+R’. 

(b) Wenn diese Ungleichung zusatzlich an der Wand mit dem Zeichen < erfiillt 
ist, dann gilt sogar u2< U?1+M inG+R’. 

Die a priort Abschatzung bleibt also auch fiir zeitabhangige Geschwindigkeits- 
komponenten w(x, y, ) in derselben Weise bestehen. Tragt man sich die ,,Energie- 
profile u? — U? fiir festes ¢ und festes « in Abhangigkeit von y auf, so besagt 
der Satz, daB eventuelle Maxima nicht gréBer sein kénnen als die Konstante M. 

Unmittelbar klar ist im AnschluB an L(w,)=0 auch das Maximum-Minimum- 
prinzip fiir die Schubspannung w,(x, y, t). Wir begniigen uns damit, es fiir fol- 


gendes Gebiet zu formulieren: 
GB AIH i i aie AV yee mail, 
RS th ae Vir Wash het Sa) nd 
t=ty, MoV Ve, MX SX 
R: Die iibrigen Randpunkte von G. 
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Satz 3.4. (a) Es sei u(x, y,t), v(x, v,t) als Lésung der 2-dimensionalen, in- 
stationdren Grenzschichtgleichungen im Gebiet G definiert, und es sei dort auch 
Uy yy, Ugy—=Uyx, Upy=Uy, Stetig. Wenn u, in einem Punkt PREG sein Maximum 
oder Minimum annimmt, dann ist u, konstant auf ganz C(B,). Die Aussage dehnt 
sich aus auf G+R’, wenn u,v samt den Ableitungen auf G+ R’ stetig ist und 
wenn zusatzlich u=0 auf R’ ist. 

(b) Ist u, auf dem beschrankten Gebiet G einschlieBlich des Randes R-+ R' stetig, 
dann nimmt u, sowohl sein Maximum als auch sein Minimum auf R an. 


Fiir den unendlichen Halbstreifen mit y,=0, y,=co folgt wieder, daB bei 
Vorliegen eines Maximums oder Minimums das Profil von der Gestalt u(x, y, to) = 
ay+B ist mit «=0, B=U(x)) wegen der 4uBeren Randbedingung, und daB 
insbesondere ein Maximum oder Minimum iiberhaupt nicht angenommen werden 
kann in G+-k’, wenn an der Wand w(x, 0, t)< U(x) vorgeschrieben ist. 


Anhang. Zum Bewets von Satz 1.2. Um die Giiltigkeit des Satzes fiir Punkte 
des Randes R’ einzusehen, braucht man nur den Nirenbergschen Gedankengang 
zu verfolgen und an wenigen Stellen zu modifizieren. Wir beschranken uns darauf, 
dies im Falle n=1, m=1 auszufiihren. Fiir beliebiges m und m verlauft es ganz 
analog. 

Es sei also G ein offenes Gebiet in der x, Ebene mit einem Randstiick R’, 
das den Bedingungen (a) und (b) aus Abschnitt I geniigt. Der Operator lautet 


——: Oo 0 | c S = 4 a , 
Ik eee ae: alee mit @->0i0,G + ¢=0 aul Rk’. 


Ahnlich wie bei NIRENBERG wird zunachst ein Lemma hergeleitet. Hierzu sei 
P, ein Punkt auf R’. Es gibt dann immer ein 7)>0, so daB der Halbkreis Hy 
und der Bogen By 

Hee Ve ey Pee P<, tS ty 

Boalt tol” oe tee tol on aklice ty 
ganz zu G+ R’ gehoren. 

Lemma. Wenn L(f)=0 ist in ganz G+’, und wenn die Funktion f ihr 
Maximum MM nirgends in Hy wohl aber auf By im Punkt PB, annimmt, dann ist 
notwendig += %p. 

Beweis. Man denke sich neue Koordinaten eingefiihrt mit R, als Koordinaten- 
ursprung. Dann ist zu zeigen, daB x,=0 sein mu8. Der Beweis wird indirekt 
gefiihrt. Es werde also x,=+-0 angenommen. Zunachst ist klar, daB A kein Punkt 
aus G sein kann, weil dann nach Satz 1.1 das Maximum auf ganz C (4) angenom- 
men wird, d.h. einem Geradenstiick, das Punkte mit H, gemeinsam hat. Es 
bleiben also nur die Endpunkte des Bogens By zu diskutieren. Es sei etwa 
B=(r, 0). Da BER’, kann man ein r>0 so finden, daB H und B mit 


Hy |x—47 (2+ <7, iSO 
B |4— 7? -2=77, 150 


ganz zu G+R’ gehért. Sorgt man insbesondere fiir 0<7<7), dann gehort der 
Punkt (0, 0) nicht zu H+ B. Der Bogen B zerfallt in zwei Teilbégen: B’ (abge- 
schlossen) liegt in H,+ By, und B” liegt auBerhalb davon. Auf B’ liegt kein 
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Maximumpunkt, d.h. dort ist <M oder SM —y mit einem geniigend klein 
gewahlten positiven 7. Da tiberall {<M ist, hat man speziell 
/SM—y aut B’, i= I want 3b. 


Es wird nun die schon von E. Horr benutzte Funktion 


h(x, t) = exp {— a(x2+#)}—exp{—a7}  (a>0) 


betrachtet. Sie ist positiv in Hy, null auf B, und negativ auBerhalb Hy+ By 
fiir <0. Man findet 


exp {a (x? + 2)} L(h) = 402 a(x, t) x* — 2a {a(x,t) +.b(x,t) % + ¢(x, t) th. 


Da (0, 0) nicht zu H+B gehort, ist dort immer 4a2a(x, ¢)x*>0. Dabei kann 
man « so groB wahlen, daB der in « quadratische Term die in « linearen tiberwiegt, 
so daB L(h)>0 ist in ganz H+ B. Wahlt man nun ein positives « so klein, dab 
mit obigem 7 stets eh(x,t) <4 in H+ B ist, dann hat die Funktion v=/+ eh 
folgende Eigenschaften: 

Es ist v<M auf B’, weil dort {<M —y ist, und es ist v<M auf B”, weil 
dort h negativ ist. Beachtet man L(u)=0, L(h)>0, so ergibt sich 


Liv) >0 auf H+B, v<M auf B. 


Andererseits ist 4(B)=0, f(R)=M und daher v(R)=M. Die Funktion v 
nimmt daher auf B+ HA ihr Maximum nicht auf dem Bogen B sondern nur in 
einem Punkt P€ H an. In P ist dann v,, <0, v,=0, und falls PEG, auch v,=0. 
Wenn dagegen P auf dem Rande R’ liegt, so ist wenigstens v, 20. In jedem Falle 
ist aber wegen c(x,?)<O0 auf R’ insgesamt L(v) <0 in PCH, im Widerspruch 
za Ly) >O-ant H-- 6, 

Es ist klar, daB das Lemma auch fiir Halbellipsen anstelle von Halbkreisen 
gilt, wenn die Hauptachsen parallel zur x- bzw. ¢-Achse sind. Man hat auch 
wieder nur die Endpunkte des Bogens zu betrachten, die auf R’ liegen. Es 
genigt dann, Halbkreise zu nehmen, die diesen Ellipsenbogen in den Endpunkten 
von innen beriihren. 

Zum Beweis des Satzes selbst werde nun angenommen, da8 die Funktion / 
im Punkt P€R’ ihr Maximum M annimmt. Ware nun Q ein Punkt von C(P), 
in dem {<M ist, so gabe es auf der geradlinigen Verbindung ven Q und P, die 
ganz in C(P) verlauft, einen ersten Punkt B mit /(R)=M, wahrend zwischen 
Q und P immer /<M ist. Es seien x,, ¢, die Koordinaten von BR. Es werden 
dann nur noch Punkte betrachtet, die der Menge H,+ B, angehoren: 


Ay: |x—«|?+\|¢—-“4/2?<7i, 154 
By: |x |? | tt ==S74; t=4; 


wobei 7,>0 so klein gewahlt ist, daB H,+ B, ganz zu G+R’ gehort. 


Es sei nun FJ ein Punkt auf der Verbindungsgeraden zwischen Q und BR 
mit |x%)—«,|<$7,. In P ist f<M, desgleichen ist {<M auf einem Segment A 
parallel zur t-Achse: x= %9, t) —aStSt, (x>0). Man betrachtet dann die Halb- 
ellipse (¢S%) mit A als der einen Halbachse und mit dem Durchmesser B in 
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x-Richtung: t=‘), OS|%) — x|<f. Fir B=0 reduziert sich die Halbellipse auf das 
Geradenstiick A, und dort ist f<M. Fir B=|x,—-x,| liegt auf dem Rande 
der Punkt R mit /=M. Beim stetigen Ubergang von B=0 zu B=|%)—%4| 
gibt es dann in der Schar der Halbellipsen eine erste mit folgender Eigenschaft: 
Auf dem Bogen liegt ein Maximumpunkt, im Innern keiner. Das ist aber gerade 
die Situation des Lemmas. Der Maximumpunkt hat dann notwendig die Koordi- 
nate *=%,, d.h. er liegt auf der Halbachse A, im Widerspruch zu f< M auf A. 


[1] 


[9] 
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Electric Current and Fluid Spin Created 
by the Passage of a Magnetosonic Wave 


R. P. KANWAL & C. TRUESDELL 


Since the discovery of magneto-hydrodynamic waves by ALFVEN?, much has 
been written about them, but one point has failed of general remark”: A magneto- 
hydrodynamic wave causes the fluid particles to spin about axes parallel to the 
electric current density it carries; the spin is clockwise or counter-clockwise with 
respect to the current flow according as the wave normal subtends an acute or an 
obtuse angle upon the magnetic field vector. This fact may be verified by inspection 
of the known special solutions. Here we present a general theory of magneto- 
hydrodynamic waves, developed in such a way as to draw attention to this 
new means of generating vorticity in fluid motions. 


Weak waves may be studied from five approaches: (a) sinusoidal or other 
special exact solutions, (b) a linearized theory of small motions, (c) the exact 
theory of lineal motions, (d) the theory of characteristics, (e) the theory of singular 
surfaces. All but the last of these have been followed in magneto-hydrodynamics3; 
in particular, a brief but exhaustive treatment by the method of characteristics 
has been included in FrreEpRIcHs & KRANZER’S excellent study of waves of all 
kinds*. We shall not expect to find results not contained, at least by implication, 
in theirs. In developing the subject here by means of the theory of singular 
surfaces we wish not only to remind physicists of the elegant simplicity of proof 
offered by the method of CHRISTOFFEL and HuGonrot, but also to throw light 
upon the results themselves through expressions which have, at every stage, 
immediate physical interpretation. 


1 [1942]. 

2 A relation of this kind, for a linearized theory, has been published by Carsro1u 
[1960]. 

* (a) ALFVEN [1942] [1950, §§ 4.3—4.51] WaLEN [1944, §§ 3—4]. In these special 
solutions the vorticity and electric current density are everywhere parallel. (b) HEr- 
LOFSON [1950, 3], VAN DE Hutsr [1950, 5], Carstoru [1960]. (c) SEGRE |1958, 2]. 
A general survey of approaches (a), (b), and (c) is given by Lust [1959, 3]. (Ref- 
erences on shock waves are not given; we remark, however, that one of us has deter- 
mined the relation between current and vorticity generated by a shock [1960, 2]). 

* (1958, 1, §§2—5]. Cy. also One [1950, 2]. The summary of Russian work 


given [1959, 1] is too concise to be assessed; we have not been able to see all the 
references cited there. Cf. also [1957, 2]. 
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ALFVEN discovered transverse waves which propagate in incompressible in- 
viscid fluids of negligible electric resistance at speed a, where® 


2_ MH; 

= A (1) 
H,, being the component of the magnetic field H normal to the wave-front®, 
jz the permeability of the fluid, and @ the mass density. In a compressible non- 
magnetic fluid for which £=/(@, 7), where # is the pressure and 7 is the specific 
entropy, if viscosity and heat conductivity are negligible the only possible waves 
are longitudinal waves travelling at the speed c, where 

2__ (oP 

Aa ( é al J (2) 
In fluids which are both compressible and magnetic, it is natural to expect that 
waves will generally be neither transverse nor longitudinal, and that their speeds 
of propagation will differ from a and c. 

First we recall the known results obtained by the method of singular surfaces. 
We do not restate the conditions defining the fluid. Writing w for the velocity 
field, we restrict “‘singular surface’ to mean a surface across which wu, 0, , and 
H are continuous, but some discontinuity is suffered by at least one of the partial 
derivatives of these quantities with respect to the spatial co-ordinates? or the 
time. In particular, shock waves are not considered in this paper. We adopt 
the hypotheses of regularity usual in the theory of singular surfaces®. 

Hugoniot-Hadamard theorem: In a compressible non-magnetic fluid, only the 
following kinds of singular surfaces are possible: 

1. Material vortex-sheets. 

2. Longitudinal waves propagating at speed --c in any direction. 

If we let [J] stand for the jump of F at the singular surface, while m is the 
unit normal pointing in the sense in which jumps are calculated, then these 
sonic waves carry arbitrary jumps of the expansion or the pressure gradient, 
but the two are related by the equation of state. That is, if 


, wee), (3) 


then w=gec?, but either @ or e may be arbitrary. ‘Material’ means “‘bound 
to the fluid”. A ‘‘vortex-sheet”’ is a surface across which the vorticity or spin 


5 M.K.S. units are used throughout this paper. 

6 In much work H appears instead of H,, in the definition of ALFVEN’s speed, 
but, as appears from the system (4), (5) below, (1) is exact, and other forms are only 
approximate. The matter is obscured in WaLkn’s solutions [1944, §§ 3—4], [1950, 
§ 4.51] but may be cleared by noting that as formulated, they are not invariant 
under addition of an arbitrary constant vector to the disturbance field, and in fact 
only disturbances not normal to the constant field can be propagated. 

7 “Singular surface’ = ‘‘wave of first order’ in the terminology of LicHTEN- 
STEIN; in that of HaDAMARD, used in [1958, 3], ‘‘singular surface’ = “‘wave of second 
order’, except that material singularities affecting only second derivatives with 
respect to the material co-ordinates are omitted from our treatment. 


8 E.g., [1958, 3]. 
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of the fluid particles is discontinuous but the density gradient is continuous. 
The discontinuity carried by a vortex-sheet is transversal; the first statement 
in the Hugoniot-Hadamard theorem asserts that transversal discontinuities can- 
not propagate but must eternally divide one portion of fluid from another. 

Corresponding theorem for ALFVEN’s waves: In an incompressible magnetic 
fluid, only the following kinds of singular surfaces are possible : 

4. Material vortex-current-sheets. These must be everywhere tangent to the lines 
of induction (lines of magnetic force); the jumps of vorticity and current density 
are tangent to the surface but otherwise arbitrary. 

2. Transverse waves carrying jumps of current density and vorticity satisfying 
the relations 


Q U[w) =— 4H, [5], (4) 
H,,[w]=— Uj], (5) 
wo ="[j)-H_. (6) 


Here U is the spead of propagation of the wave, and w and j are the vorticity 
and the electric current density: 


w=culu, j=curlH; (7) 


the jumps [ew] and [j] must be tangent to the wave-front, and H is split into 
normal and tangential components: 


H=H,n+H, =(H-n)n—nx., H=nxH., (8) 


Thus H, is the vector obtained by rotating the tangential component H- of H 
through an angle of 90° counter-clockwise about the normal n. 

From (4) and (5) we see at once that if H,,=0, 7.e., if the wave-front is tangent 
to H, then U=0O, and we fall back upon Case 1. This kind of singular surface 
again divides permanently two portions of fluid, but it must be a surface swept 
out by lines of induction. The jumps of w and j across it are arbitrary, so long 
as they be tangential. If H,,= 0, it follows at once from (4) and (5) that U?=a?. 
Any wave-front not tangent to the magnetic field is propagated at ALFVEN’S speed 
+a. Note that a’, unlike c?, depends upon the orientation of the wave front 
relative to the magnetic field, being greatest for a wave which moves straight 
down the lines of induction; then H,=0, and hence =O also. Formally, 
Case 1 may be subsumed under Case 2, since a=O if H,=0. In both cases, 
(4) holds, but in the former, it is vacuous, while in the latter, it gives the connec- 
tion between vorticity and electric current density mentioned in the first sentence of 
this paper: [ew] and [j| point in opposite directions if UH,,>0, in the same direction 
if UH, <0, while theiy magnitudes stand in the ratio u|H,,\/0|U]. 

While the known results substantiate the foregoing statements, we cannot 
point to a published proof for them in this generality. Such a proof is so easy, 
however, that we omit it. 

In this paper we show that (4) ¢s a universal relation, valid for all kinds of 
magneto-hydrodynamic singular surfaces, whether the fluid be compressible or 
incompressible. Thus the connection found between spin and current is a general 
one. The relations (5) and (6), and consequently also the speed U, will be seen 
to need modification when compressibility is taken into account. 
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Kinematical formulae. For any singular surface as defined above we have 
the identities of Huconiot and HADAMARD, which may be written as follows?: 


[gradu] =n(a+Uen), [uj] =—U(a+Uen), 
[div u] = Ue, [grad 0] =oen, (9) 
[w]=nxa, Ci xe) 


where t is the acceleration, and where e, the strength of the condensation, is 
defined by (3),;. The vector @ is tangent to the singular surface; the last two 
equations give an immediate interpretation for this transversal vector in terms 
of the jump of vorticity. ‘ 


Preliminary reductions. By the above definition of a singular surface, [7]=0. 
Since dissipative mechanisms are neglected, 7=0, where the dot denotes the 
material derivative. Hence [7]=0. It follows by HApAmarp’s conditions® 
that if U +0, then [grad 7]=0. Hence 


[grad p] = [grado] + (2°) [grad], 


(10) 
Cr orem. 

(While presentations by engineers and physicists usually assume that 7 =ccnst., 
such an assumption is unnecessary, as has been known, in principle, since HuGo- 
niot’s day (1885).) 

Second, since H is solenoidal, it follows from WEINGARTEN’S theorem !° that 
[grad H] is transversal; in other words, all magneto-hydrodynamic singular 
surfaces are current-sheets, and 


[grad H]=nB, [H]=— UR, 
[f]=nxB, B=—nx{j], 


where 8 is tangent to the singular surface. The second of these formulae follows 
from the first by HADAMARD’s conditions !°, and the last two interpret the vector 
B in terms of the jump of electric current density. 

Main calculation. When resistance is negligible, FARADAY’s law asserts that 
the lines of induction are material 1": 


(11) 


H —H.gradu+ Hdivu=o0. (12) 

An equation expressing balance of linear momentum for an inviscid magnetized 
id ic 12 

oe oie + grad +pjxH + of =o. (13) 


9"(1958,.3; Eq. (9-5). 

10 (1958, 3, § 4]. 

11 This form appears in the earliest magneto-hydrodynamic researches of Cow- 
LING; its implications are discussed more formally by WaLEN [1946, § 3.4], TRUES- 
DELL [1950, 4], Lunpquist [1952, §IV], Cowrine [1957, 1, § 1.31]. This analogy 
between the induction in a magnetized fluid and the vorticity in a non-magnetized 
fluid is not to be confused with the connection, emphasized in the present paper, 
between vorticity and electric current density in the same fluid. 

12 ArFvEN [1950, 1, Eq. 4.2 (5)], ELsasser [1950, 2, Eq. (41)], and many later 
authors. 

Arch, rational Mech. Anal., Vol. 5 290 
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From (12) and (43), using (14), we see at once that 
UB—H,(a+Uen)+(H,n+H-,) Ve=0, 
—oU(a+Uen)+ocen—p(nxP) xH=0, 


(14) 
where we have assumed the body force f to be continuous. Since —(n xB) x H= 
(8 -H,)n—H,8, (14); and the tangential component of (14), become 

H,a—UB+UcH,=0, 
—90Ua—wH,B=0. 


(15) 


It is easier to interpret these equations if we rotate all vectors through a right 
angle. To the cross products of (15) by m, simplified by use of (8)3, (9);, and 
(11)3, we subjoin the normal component of (14),, thus obtaining the definitive 
system: 


o Ulw) + nH, [i] =0, (16) 
H,[w]+Uij]=—eUH,, (17) 
oe(U?— 2) =—xlj)-H,, (18) 


valid for all singular surfaces. Every quantity occurring has an immediate 
physical interpretation. 

(16) is the same as (4); 7 7s the universal relation between spin and electric 
current density announced above. The interpretation we have given for it in the 
case of an incompressible fluid was so worded as to remain valid for compres- 
sible fluids also. (47) and (18) show how (5) and (6) are modified by the effects 
of compressibility!%. The variety of possible singular surfaces is greater. 

Classification of singular surfaces. Nothing has been cancelled in deriving 
(16), (17), and (18), which are all the conditions holding. As we shall see, the 
possible wave speeds U are determined by them. We assume that o>0, c?>0, 
H-=+0, and we shall tacitly discard cases when the only solutions are e=0, 
(w]=0, [j]=0, 7.e., when all quantities are actually continuous. 

Case 1. e=0: The surface is a vortex-sheet. Then by (18), [j] is parallel 
to H. if H, +0. 

Case1a. If H,=-0, e=0, then [2e] is also parallel to H, if H-=£0, and 
U?=a?. If H-=0, [j] may be arbitrary, but again U2=a?. Thus there are 
propagating vortex-sheets, purely transversal waves which propagate at ALFVEN’S 
speed, carrying jumps of vorticity and current density connected by (4). If the 
wave 1s moving straight down the lines of induction, the jump in current density may 
be arbitrary both in magnitude and in direction; if the wave is inclined to the 
magnetic field but not normal to it, the magnitude of the jump of current density 
may be arbitrary, but its direction must be that of nx H. 

Case 1b. If H,,=0, e=0, the system (16)—(18) becomes 9 U[w]=0, U[j]=0 
[4] -H,_=0. Therefore, since H, =£0, [j] is parallel to H, but U=o, and [eo] 
is arbitrary. Thus a vortex-sheet tangent to the lines of induction is necessarily 

18 ‘We may write (18) in the form 

@=geU*+u[j)- HH, 


which may be reconciled formally with (6) if we suppose «U2 
compressibility become negligible. 2 ye itn ate eae 
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material; it can carry any jump of vorticity; the jump of current density must be 
parallel to H but is arbitrary in magnitude. Since a=0 when H,,=0, this case 
may be regarded as a limiting case of the former one. (The greater variety of 
material vortex-sheets possible if the fluid is incompressible follows from the 
fact that wis then restricted only by (6), while in a compressible fluid a= 9 ec?, 
so that. @=-Oaf 6-03) 

Case 2. e=-0: The surface is a sonic disturbance. If H,=0, from (18) we 
now get U?=c?, while from (16) and (17) we see that either U=0 or U?=a?. 
This is a contradiction unless a?=c?. Hence follows a major difference between 
the compressible and incompressible cases: No sonic wave can propagate straight 
down the lines of induction except when a?=c?, that is, 


(eo) #2. (19) 


For an ideal gas, this relation reduces to yb=mH?. Such a delicate balance 
between the static and magnetic pressures would be extraordinary indeed, and 
henceforth we exclude it. 
We take the scalar product of (16) by H,,H_, of (17) by @UH_, subtract, 
and eliminate [j]-H, by (18). Thus we obtain an equation determining the 
: 2 
speeds of propagation: (U2 a) (U2 8) =U yu A 05 


? 


Q 
where a factor ¢ has been cancelled. 
Case 2a. U=0, e+0: The singularity is a material condensation. Then 
from (20) we see that a?=0; therefore H,=0. The system (16)—(18) reduces 


to one member: Ob — 119) oH. (21) 


This case includes Case 1b in the limit when ¢—>0, and it includes the limit case 
of Case 1a obtained when a->+0. For arbitrary «, summary of Cases 1b and 2a 
shows that every material singular surface is tangent to the lines of induction; 
the most general such surface may be decomposed into two: 

(i) A vortex-current-sheet carrying arbitrary jumps in w and j. 

(ii) A surface carrying a longitudinal condensation « determined by (21). 


That a jump of current density which is inclined to the magnetic field gives 
rise to a material current-sheet which is the seat of a condensation is perhaps the 
most remarkable effect of the interaction between compressibility and magnetism. 
If the jump of current density lies in the half-plane to the left of H, the conden- 
sation is positive; if in the right half-plane, negative. Thus a suitable tangential 
flow of current can render a sheet of induction a material boundary permanently 
separating a region of dense gas from a region of rare gas. This seems to be 
the principle which makes possible a magnetic wall. 

Case 2b. H,=0, U=+0, e=F0: The surface is a propagating condensation 
always tangent to the lines of induction. Since a=0, (20) yields 

le He jiplek 


Ua= eo = C+ 3 (22) 
Q Q 


and the system (16)—(18) reduces to the form 


[wy=0, [fJ=—eH_, ei =— [9] H_, (23) 
29* 
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so that these waves are irrotational; they carry a jump of current density which 1s of 
magnitude |eH| and is in the direction opposite to enxH; they travel at super- 
sonic speed. As H 0, these waves pass continuously into ordinary waves of sound. 

The cases considered so far are special and in some way degenerate. We 
come now to the general case, when the universal relation (4) assumes its full 


power. 

Case 2c. H, +0, U-+0, e+0. Two pairs of speeds of propagation are de- 
termined by (20), an equation the nature of whose roots has been determined 
by Frrepricus & Kranzer. Without using their results we see from (16) that 
now neither [j] nor [av] may vanish, and that these vectors are parallel to each 
other; by (17), each is parallel to H_. The general solution of the system (16) 
to (18) may be written in terms of a scalar parameter 0: 


(jiSdo(mx), 


= My 0 
[ww] = aie (nx), (24) 


Let us think of the condensation ¢ and the magnetic field H as given. H and 
the wave-normal n determine two pairs of roots Uj, and U%,, of (20); as 
shown by FrIEDRICHS & KRANZER *, U},>>a® and U5,,<a?; also Ui,y>c? 
and U2.,<c?. By (24)3, then, 6 has the same sign as e for the slow waves but 
the opposite sign for the fast waves. By (24) we see that for given values of 
H,,, 2H, and « there are exactly four possible waves. Summary of the imph- 
cations of (24) yields the main theorem: Aside from the special circumstances 
in which (17) holds, no wave carrying a condensation can propagate down the lines 
of induction, but any other direction of propagation 1s possible. In the general 
case, when the wave-front is not tangent to the lines of induction, there are four 
possible waves, deternuned uniquely by H, n, and e. One parr travels at supersonic 
speed; the other, at subsonic. The jumps of current density and vorticity are 
parallel ton XH. The sense of [Jj] is the same as that of nXH for a slow positive 
or fast negative condensation, the opposite in the opposite cases. Both members of 
one pair carry the same condensation and the same jump of current density; they 
differ only in the sign of U, which is either + or — at will, and in the sense of [w, 
which 1s opposite to that of [j] or the same, according as Un-H>0 or Un-H<0. 
In every case, the magnitudes of all jumps stand in the ratios 


“Ai, 


Lf1 01]: | = [erg aE | 


0 |U| 


1). (25) 


“4 FRIEDRICHS & KRANZER have ‘‘>” and ‘‘<”’, but the signs of equality are 
not possible in the present case. U?=a?+0 yields H; =o by (20); hence by (18), 
since ¢=£0, it follows that U2=c?. Likewise, U2=c? imphes that U?=a?. Thus c?=a?, 
but this is the case, excluded here, when (17) holds. 

For completeness, we add a proof of the results stated in the text. The case when 
a®=c® is excepted for the reason just given, so that Ho: also a?>0 and c?>0. 
It is then immediate from (20) that U2—a? and U2—c? have the same sign and do 


not vanish. Since Ufgst Ujow=@?c?, from U},4>a? it follows that U2,,<c2. Q.E.D. 


Magnetosonic Waves 439 


As H->0, the fast waves go over into ordinary waves of sound, longitudinal 
waves carrying an arbitrary condensation in an arbitrary direction and propa- 
gating at speed --c, while the slow waves become indeterminate material singu- 


larities. 


1960, 1. 


As e->0, both kinds of waves go over into ALFVEN’s waves. 
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Stress Tensors in Elastic Dielectrics 


R. A. TOUPIN 


1. Introduction 


In a previous paper [1] we presented a theory of the finite deformation of 
an elastic, electrically polarizable material. As originally presented, the theory 
divides a symmetric total stress into two asymmetric components which we 
called the local stress and the Maxwell stress. Such a decomposition is not 
necessary but is made for the purpose of introducing a constitutive relation 
between the local stress and the deformation and polarization of the material. 
It is the purpose of this note to clarify the theory presented in [7] and to exhibit 
a number of equivalent formulations of its basic equations. In these equivalent 
formulations, a variety of ‘‘stress’” tensors appear, and we shall attempt to make 
clear the relations between them and the stress tensors introduced in [J]. 


The problem of elastic dielectrics affords a simple model of a classical field 
theory based on a variational principle. Results on this model serve as a guide 
to the more complex and general dynamical theory of the electromagnetic field 
in a moving and deforming material which must contain the present statical 
theory as a special case. MINKOWSKI[2] and ABRAHAM [3] many years ago 
proposed different expressions for the electromagnetic stress, energy, momentum, 
and energy flux in material media. We do not cite the extensive literature which 
supports one or the other of these expressions. As emphasized by MOLLER [95] 
and GyorGyI [4], in a material medium, the electromagnetic field forms only 
part of the physical system. Any division of energy, momentum, stress, and 
energy flux into electromagnetic and mechanical components is bound to be 
somewhat arbitrary, and it is fruitless to attempt an independent theory of 
either component. We trust that the considerations to follow will illustrate 
these ideas concretely in a simple case where the physical principles are easy 
to grasp and intuition is trustworthy. 


2. Some preliminary formalism 


a) The divergence theorem. If j' is a contravariant vector density, 1.¢., a 
relative tensor of weight 1, then its natural divergence is a scalar density *. 


divf=af, a= 2. (2.1) 


t Biren 
ox 


i The natural divergence is defined independently of any definition of covariant 
differentiation, In metric or affinely connected spaces with symmetric affine connec- 
tion, V; f' = @;f' provided f is a density. Cf. ScHoUTEN 16 Che this 
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If + is a surface enclosing a region » where f is continuously differentiable, 
then 


§ fids,—f divfde, (2.2) 


which is the divergence theorem [6, p. 97]. 


Suppose # is a region divided by a surface 4 across which f has a discontinuity 
[}] = ¥*— f where f* and Fare the limiting values of f on 6. Then if f satisfies 
the smoothness conditions of the divergence theorem in the two parts of », and 
if the limits {* and f- exist, we have 
$ fi'ds,= f divide + f [f]4s;, (2.3) 
6 Uv ONY 
where d6; points into #*. 


b) Motions. Let . 
x = 0 (X4, a), X4 = XA (x' 2) 

(2.4) 

ox" 

CG 


+0 


(x/X) = det | 


be a one parameter family of non-singular continuously differentiable mappings 
(point transformations) where the indices 7,7,... A, B,... have the same range 
1,2,...,”. In the applications, the variables X4 will be material coordinates, 
the x’ will be spatial coordinates, and m will equal three. We may think of / 
as the time though this is not a necessary nor even desirable interpretation in 
purely statical considerations. It is preferable to regard A simply as an index 
set which labels the configurations of a given set of material points {X}. In 
these preliminaries, we need not assume that the dimension of the space is 3 
nor that the space with coordinates x‘ is metric in the sense of Riemannian 
geometry. This will allow application of the general formulae to be derived to 
problems other than the particular problem of elastic dielectrics considered in § 4. 


We set 


é 4 Sy 7A 
y — Ox x4 => OX 
rely ee Ohne eo 5) 
geno ee: 
x4 = ax4 ) Cea ax! 
and record the identities * 
8 3 x es 
oS — ide x4 = — Xf yv’, 
xi, Xf = d;, Ge aa we 
6 (x/X) ae (a/X) Dee 0 (x/X) ne (ae/X) xi, 
ax! ‘ ax 
A i 
Ave ax! aie 
: — — X4 xP, a hp ey 
Ox’ OX; 


c) Invariant derivatives. Let /;/" (a, 4) denote a tensor field of arbitrary 
rank and weight whose components may be functions of the parameter 4 as 


* Cf, TRUESDELL [7]. 
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well as the coordinates a. Let 


(2.7) 


The Lie derivative of f with respect to v’ is a tensor field of the same type 
as f with components given by* 


v 


where w is the weight of f. 

Any set of quantities f??, Q=1, 2,...,N which transform by a linear law 
with coefficients determined by the transformation and its derivatives is called 
a geometric quantity [6, Ch. II, §3]. The definition of the Lie derivative may 
be extended to arbitrary geometric quantities and has the general form ** 


£0 = 0" On f — FG 5 Oy UF, (2.9) 


where the F?” are constants determined by the law of transformation of the /*. 
Formula (2.8) is a special case of (2.9). For example, when f is a contravariant 
vector density, Fi%,=6/'6;, — 6;6%,. The notation of (2.9) is that used by BERG- 
MANN [8]. 

In a space with symmetric affine connection Ey and when /* is a set of tensor 
components, we have 


Vf = 0, (9 + FRA ie, (2.10) 


where the F?” will have the same values as in (2.9) for corresponding tensors 
or sets of tensor components /”. 


In affinely connected spaces we define the material derivative of f by 


f= pf? +o"D,, 72. (2.14) 
The convective A derivative of f is defined by 


* \ 


Papter (2.12) 


In a space with symmetric affine connection, the partial derivatives of v’ 
and f* which occur in the definition (2.8) or (2.9) of the Lie derivative may be 
replaced by covariant derivatives [6, p. 152], 


EU Fae i (2:43) 


provided, of course, that /? is a tensor field or set of tensor fields for which the 
covariant derivative is defined. We then have the identity 


f? = f? — F920, 0" f4. (2.14) 


A geometric quantity /° together with its first partial derivatives 0;/° is a 
geometric quantity (f°, 0,f°). In the sense explained by SCHOUTEN [6, p. 105], 


* 16Ch. 1, $10]. 
BRC]. YANO 120], 
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Lie differentiation and ordinary partial differentiation are commutative: 
£0, PP = 0,87. (2.45) 
Thus, so also do the operators 6; and (*) commute: 
5 Pa ape. (2.16) 
However, the Lie derivative of an affine connection is given by 
£ Tin =V,V,,0" + Ri 0, (2.17) 


where R is the curvature tensor based on I and we assume Tin=0. Taking 
the Lie derivative of equation (2.10) and using (2.15), we get 


£V,P VP =Feae Tn, (2.18) 


from which it is apparent that, even in flat spaces where Rj;},",=0, Lie differentia- 
tion and covariant differentiation do not commute. It follows that covariant 
differentiation and convected 4 differentiation do not commute: 


* 


Seen * 
Vie? Vil eae ae (2.19) 


As a consequence of the identities (2.6) and the definitions (2.9) and (2.12) 
we have the useful identities, 


RE=XK=0, XF=0, #=0, (2.20) 
where the fields X* are treated as absolute scalars. 
It follows from (2.20) that if we define the scalars F'?~ by 


agi l CAD. G\ LW C7, Guee oe nt a (2121) 
then 
Tne loge TX ee (2:22) 


fi” = |(a/X) 


where f is a tensor of weight w, and 


pigrest( (Gir rs > nme @ ee eaters (2.23) 
since, for scalars, F —F. If the Fé? are expressed as functions of the material 
coordinates X“ and of A, then F= ae It is easy to see that the formulas (2.21) 

oO + 


to (2.23) can be written in the abbreviated form 


-1 
2 _ 42 ES, F4— A4 ee 
f ov i. (2.24) 
pP=A*, 4 


where the coefficients A®, are certain rational functions of the win (2.24), 
the / may also stand for the ordered set of components of more than one tensor 


field. 
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d) Lagrange identities. The field equations which follow from a variational 
principle are determined by assigning a scalar density Y as a function of the 
components (f%, ;/%, 0;0;f%, ...) of a geometric quantity: 


Pe a Pr (2.25) 
The condition that Y transform as a scalar density, 7.¢., 
LT =| (laa? (2.26) 


under general transformations of the coordinates # leads to a set of algebraic 
and differential identities satisfied jointly by Y, 0L/8/*, 0£/0(8f"), eee rs 
0,f°, ... as shown by ScHouTEN [6, Ch. II, §41]*. We restrict attention here 
to the case where ¥ depends only on the fi and their first partial derivatives 
@,f°. This implies no loss in generality since the geometric quantity ibe af”, 
..., 8; 0;,... 6;,f%) can always be regarded as the geometric quantity (e702 


here 27 = (72, Be, cig Oe Oe Cn) ton Owever, an ibe applications, the 
geometric quantity /* which appears in (2.27) below is assumed to have a tuee 
homogeneous law of transformation corresponding to the special case where / 

stands for the ordered set of components of a set of tensor fields. Thus we assume 


that FY has the form 


L= Lf, 0,f). (2.27) 
We then have A a 
e then h $= 4h 2) = 78h + oO eh (2.28) 


which must hold for arbitrary v*. In (2.2 
{?=@,f°. By equating coefficients of v*, 2,, 
we get the identities **: 


8) and henceforth we use the notation 
v®, and @,,0,v" on both sides of (2.28), 


nt at ger lee (2.29) 
5 OL eae OL a iA 
jt= alt Elsah tsa), (2.30) 
0 = 2 Foo. (2.31) 
org, 


In field theories based on a variational principle with Lagrangean ¥ the set of 
quantities, : : a 
ct = GL — Be ‘ (2.32) 


v 
is called the canonical stress matrix. In dynamical theories where the underlying 
space of points # includes one dimension of time, ct’; is called the canonical 
stress-energy-momentum matrix. Only in special cases will ct’; transform as a 


mixed tensor density of weight 1 under general transformations of the coordi- 
nates x’. 


* Cf. also THomas (9, Chs. VI, VII]. 

** The non-trivial identities (2.30) and (2.31) were obtained by EINSTEIN [13] for 
the case when /® is a symmetric, non-singular, second order absolute tensor. The 
generalization of these identities to an arbitrary geometric quantity /° upon which 
the remaining identities of this section rest was given by BERGMANN [8]. Various 


special cases of these identities are well known in differential Peometi yn See we-l5, 
THomas [9] and ScHouTeN [6]. 
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On computing the natural divergence of ct’; and using the identity (2.29) 
we find that 


? 


6; (ct’,) LG Be (2.33) 
where La is the Lagrange derivative of FY with respect to /* defined by 
Lo= erat Qe: 
Q aj? 1 af? ( 34) 


The Lagrange derivative of # with respect to a tensor of weight w is a tensor 
of weight 1—w. Integrating the last term in the identity (2.30) by parts, intro- 
ducing the Lagrange derivative (2.34), and making use of the identity (2.31), 
we get the identity 
i Qi n (OL pai 
ot Sg Fai — O(a Fast). (2.35) 
Ort 

If one now computes the divergence of ct’; the last term contributes nothing 
because of its antisymmetry in the indices 7 and k and one gets using (2.33) 


Loff + 6;(Le FR; f") =0, (2.36) 


which is BERGMANN’S zdentity [8]. Equivalently, if we put 


qi i i fd OL yaoi = 
B= ty t Lo Fai = — &( Fail), (2.37) 


[k 


we see that (2.36) is equivalent to the identity 
0,2', = 0. (2.38) 


Given any scalar density function of a geometric quantity /® and its derivatives 
/%, formula (2.37) constitutes a general rule for the construction of an array 
of m? quantities whose divergence vanishes zdentically as a consequence of the 
assumed law of transformation for Y. The /* need not be a set of tensor fields 
but may also include more general geometric quantities such as an affine con- 
nection. 

Special interest is attached to the case where one of the quantities included 
in the set /® is a symmetric, non-singular tensor g,,,, Which we may think of as 
the metric tensor for the underlying space, though this is not a necessary inter- 
pretation for the present analysis. In this case, we indicate the functional de- 
pendence of # by writing 


L = BA (iio O;, Smn> hae Pe) (2.39) 


and separating the tensor g,,,, explicitly from the total set /®. If /* is a geometric 
quantity, the remaining set of components g° is also a geometric quantity 
whose Lie derivative we denote by 


£ oy =v" ag? — DY" 0,0! pe. (2.40) 
We introduce the special notation t”” for twice the Lagrange derivative of @ 


with respect to g,,,,: 
mn OL 6L 4 
Laimae la Ct Ge Ben) ] ; eee 
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BERGMANN’S identity (2.36) applied to a Lagrange function having the form 
(2.39) yields TRAUTMAN’s identity [1]: 


V,t', = Lo oF + 8,(Lg 24} 9"), (2.42) 


where = ir t'® and V, denotes covariant differentiation based on the Christoffel 
symbols of the tensor g,, ,- 

The Lagrange derivative of Y with respect to an absolute tensor field is a 
tensor field of weight 4, 7.e., a tensor density. For reasons to appear later in the 
application of these general formulae, we shall call the symmetric tensor density 
t'? defined in (2.41) and the associated absolute tensor ¢’7=t'//|/|det g,,,,|, the 
Cauchy stress. From (2.42) we see that a sufficient condition for the covariant 
divergence of the Cauchy stress to vanish is that Lg —0, 1.e., that the Learnee 
derivative of Y with respect to each of the remaining field components 
vanish. From (2.33) it is apparent that L,=0, which, in the case of a Lagrangean 
having the form (2.39), includes the condition t””=0O, is a sufficient condition 
that the ordinary divergence of the canonical stress matrix vanish. We have 
previously noted that the ordinary divergence of the tensor qi, vanishes 7denti- 
cally. 

Henceforth we restrict attention to the case where ¥ is independent of the 
partial derivatives of g,,,: 


A (ac BAC eee (2.43) 


When # has this form we can solve the identity (2.30) for the Cauchy stress. 


i i ae , CROP oe 
t= t+ 984( 2 go” 4 7): (2.44) 


Now, by definition, the Cauchy stress is symmetric. 


l= 0, (2.45) 
which, by (2.44) implies that 
i 2p ALO Ry ie aay 
of + 2S dal G: daraes vm | = 0 (2.46) 


Thus, in general, the canonical stress ct'’=g’* ct, is not symmetric. Equation 
(2.46) is an explicit formula for its antisymmetric part when the Lagrangean has 
the special form (2.43). 


3. Field equations and boundary condition of a variational principle 
Let Y have the special form (2.43) and consider the integral 
Shel old Oat) 


where 9* =" (a, 4) but the metric components g,,,,(a@) are assumed to be in- 
dependent of 2. The region of integration #(A) in (3.1) corresponds to a fixed 
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set of material points X* and thus depends on the parameter 4. Let #(A) be 
divided by a surface of discontinuity (A) where we assume that (A) corresponds 
also to a fixed set of points X*. The metric will be assumed continuously differ- 
entiable throughout 2 (A), but the fields y® will be assumed continuously differ- 
entiable only in each of the regions »* and »~ and may have ordinary discon- 
tinuities at Y(A). Under these conditions wé have 


Cel CS (RR ee 
aa} ?t fe Emn I Age I age Gn)ae. (3.2) 


Since 9% = 8,9, and Sn —= On 0" Sn +O," Syp, We can integrate certain of the 
terms in (3.2) by parts and apply the divergence theorem (2.3) to get 


aa={e Vin Mo + Le G?) do + H(t ve ay Od, — 
6 


ah we Pm ( ) 
33 
ie lee 
Srv OPn 


Let the derivatives (variations) v’ and gy be subject to a set of N constraints 


0=C°=a%v' + 0% @", w=1,2,...,N. (3.4) 
By introducing the Lagrange multipliers 7,, in the usual way we obtain the 
result : 

A sufficient condition that the integral I(A) have a stationary value for arbitrary 
variations (v*, p®) consistent with the constraints C?=0, vanishing on the boundary 
5(A) of (A), and continuous at S(A)=0, 1.e., [v*] =0, [p°]=0, ts that the field 
equations 


(3.5) 


be satisfied at each interior point of the regions »* and vw and that, at the surface 
of discontinuity SF, the boundary conditions 


[tJ Nn — No a”, =0, 


aL a 
a | Mm + No Oa = 0 
OPm 


(3.6) 


be satisfied at each point of F(A) interior to v (A). 
Remarks. Suppose that the g® are the components of a set of tensor fields 


and introduce the scalars OYA", gy” asin (2.24). Then one has gp” =A*y, OY” 
and from (2.6),, v’= —x%, X“. Thus the assumption that the variations (v®, p) 
are continuous at (A) is seen to be equivalent to the assumption that the 
variations (D”, X4) are continuous at (A) provided one assumes that [A ]=0 
which holds when the X4 are continuous. A different set of boundary conditions 
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arises if we write dI/d/ in the form given by WEIss [12], 


dh : ou s 
= | (ie 4 Wises Ld, a L\aL, 


(3.7) 


= | ls Hal Bee ae 


farang eer 


Y 6 Maia 


I 


ge ae Big I} aS,, 


where, in the last equation of (3.7), the variation ¢” = g® + v*é, y® is not a 
tensor unless all the y® are absolute scalars and therefore, in general, has no 
particular geometric significance*. One then sees from (3.7) that a sufficient 
condition that J(A) have a stationary value for all variations (~®, v*) which 
vanish on 4 and are continuous in # (A) is that Ly =O in #* and # and [ct';]n;=0, 


| A 2 | n,=0 on Y(A), provided there are no constraints on the variations (po, v"). 
or 


4. Elastic dielectrics 


The formalism of §2 and §3 will now be applied to a special case to reproduce 
the equations of the theory of large elastic deformations of a solid dielectric 
material considered previously in [J]. 

Let the space of points # be Euclidean three-dimensional space, and let the 
Lagrangean function have the form 


L = B(X*, XE, Be, g;;) + BA — (€0 Va/2) 87 8,9 9; 9 (4.1) 
where g;; are the components of the Euclidean metric tensor, g is an absolute 
scalar field called the electric potential, JS’ is a contravariant vector density with 
the transformation law, 2 

aa 

P= | (a pe) oy, (4.2) 
called the polarization density, and YW is a scalar density we call the energy density 
of elastic deformation and polarization. Note that YW is independent of @ and its 
derivatives, the central term in (4.1) is independent of the deformation gradients 
X#, and the last group of terms in (4.1) depends only on g;; and 0;@. €)is a constant, 
and, as usual, g denotes the determinant of the metric. Since we assume that 
the metric is Euclidean, there exist coordinate systems, the rectangular Carte- 


pian frames, for which &:;—0;; everywhere. However, it is to be noted that 


0;;+0 except for rigid variations. In some works on electromagnetic theory it 
is ewe to represent the polarization field by the absolute vector field 
eas. ‘| \/q. Actually, it is more convenient to work with the vector density $8 


ts WEISS [Z2, p. 105] states that the @®’ are ‘‘more fundamental’ than the varia- 
tions y® and gives a reason we do not understand. The variations pas on the other 


hand, have an invariant significance even in non-metric or non- -affinely connected 
spaces. 
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which has the transformation law (4.2). If one restricts the coordinate trans- 
formations to the unimodular ones, no distinction need be maintained between 
tensor densities and absolute tensors. The scalar density YW is related to the 
absolute scalar function 2» introduced in [7] by 


W = Qo (x/X) 7 2 (4.3) 


where @ is the density of mass in the undeformed or natural state of the material. 
A material characterized by the field equations and boundary conditions (3.5) 
and (3.6) with a Lagrange function having the form (4.1) will be called a perfectly 
elastic dielectric provided the constraints do not include the condition %—0. 
The canonical stress in an elastic dielectric is given by 
eee re ay 
= 6; £ aor ; axe X; 
i ow i oi i ae 
= hears E;+e|/gF E; oe oj(B- E+ /sE*), 
where we have introduced the electric field, E;= — 0,9, E'= giE,, %.B=f'E,, 
etc. 
Now in [1] we introduced the absolute scalar field x’ defined by 


a! = (a/X) B'/oo (4.4) 
and wrote 2’ in the form 
eK Age (4.5) 
We then introduced a local stress tensor te, and a local field E,; defined by 
A ey io ee (4.6) 
Ox", On 


= |(a/X)|+ 09 is the density of mass in the present configuration. From 
(4.2), (4.4), (4.5), (4.6) and the identities given in § 2 we derive the identities: 


Ce nt eee 
o (4.7) 
ans — i LE;. 
oe 
Substituting these relations into (4.3), we get 
cts =u, + BE; + & gE’ Ej — 6; (E+ LE) 8 — (£0 9/2) HE, (4.8) 


which exhibits an explicit relation between the canonical stress and the local 


stress. 
For elastic dielectrics, the symmetric Cauchy stress as given by (2.44) turns 
out to have the form 


4 a aN 4 9n Of m 
By ct + (0m — 970m) x ) 


ae (4.9) 
= ct, — (Ej; +E) B+ 5 (E+E): 8. 
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Substituting from (4.8) for the canonical stress in (4.9), we get 
. . . — . — vi Of 
f=, = Ep Be |g E' E; — (&|/9/2) 6; 2? = Na ia (4.10) 
om 


The field equations and boundary conditions are obtained from the general 


e 
formulae of §3 as follows. First note that XK =o, X#4=0, so that the constaints 
(3.4) corresponding to these conditions have a simple form. We then consider 
a region (A) filled with elastic dielectric which might consist of two disimilar 
materials with the common boundary “(A). In #* and » we assume the varia- 


tions i and @ are arbitrary. We then get the field equations and boundary 
conditions: 
Vit, ==i0) 
Le 2 (4.11) 
é9 \/9 V? p— div $=0, 


which must hold at each point of * and #, and 


[tilm,=0, [eo Voz +81 2,=0 (4.12) 


which must hold at each point of Y contained in ». 


When the field equation (4.11), is satisfied, the Cauchy stress tensor (4.10) 
reduces to the form 


t= ty +E; B+ (eo 9/2) 5 (4.13) 
given in [1]. Thus the field equations and boundary conditions (4.11) and (4.12) 
are equivalent to those of [7]. When the field equation (4.11), is satisfied, it 


follows from (4.9), that the Cauchy stress and the canonical stress are equal 
in elastic dielectrics. 


From the symmetry of the Cauchy stress and (4.10) we conclude that 
tla a Eu yy] =0, (4.14) 
which is equation (10.36) of [7]. The derivation of (4.14) given in [7] shows 


how this identity follows from the invariance of X’ under rigid motions. 


It is natural to introduce still another stress tensor in elastic dielectrics 
which we define by 


MS ee (4.15) 
Let us call yt'’, the material stress. By (4.14) it is always symmetric. Thus 
the Cauchy stress can be given the alternative decomposition 
{7 = gti? + mi, (4.16) 
where 
m'? = &9 |/g(E' E! — 3g"! E2) (4.17) 
is a type of Maxwell stress tensor, or electromagnetic stress tensor, which is 
symmetric and has the same form in all materials. 
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As shown in [1], a stored energy function Y which is invariant under rigid 
motions must reduce to a function of the quantities 


a= Gas TeX), 
Crg= £3; eae (4.18) 
IT* = |(a/X)| X4 Boo, 


each of which is an absolute scalar under general transformations of the spatial 
coordinates x’. One can then show that the material stress tensor yt’! is given by 


7] a o ee a A 
mt -0 OC4p X4 XB; (4.19) 


analogous to the formula of ordinary finite elasticity theory (cf. TRUESDELL 
[7, Eq. (39.2)]). 

Following this deluge of formalism and definitions of stress tensors in elastic 
dielectrics, perhaps it is best that we restate our position. The fundamental 
entity from the point of view of mechanics is the Cauchy stress ¢’? which is a 
certain function of the fields X*, X¥*, g;, 8, and g g;; fixed by the functional 
form of W%(X*, X*, f° , &i;). The field equations and boundary conditions are 
(4.41) and (4.12). renese equations are obviously independent of any decomposi- 
tion of the Cauchy stress into a mechanical and an electromagnetic component. 
Decompositions of the type (4.13) and (4.16) may be used as intuitive guides 
in the construction of admissible constitutive equations for YW but are other- 
wise irrelevant. A distinguishing feature of the decomposition (4.16) is that m’’ 
is independent of $ and of the deformation and has the same form in every 
material. It need not vanish in a vacuum where W=% =0. On the other hand, 
the material stress yt’ is a function only of the polarization ¥ and the deformation 
and depends on these variables in a way which is characteristic of the material. 

Consider next the Lagrange derivatives of # with respect to the field variables 
X*, ', and ~. Let us denote them by 


OX aw 
ee eA |; Ue WR ot Ss 
Py es xe ie 
i (4.20) 
Q= ep V/s VY?» — div. 
For elastic dielectrics, TRAUTMAN’S identity (2.42) takes the form 
V, t'; = & XF — Q, 8, P' — VE; + (6" 6, — 6} 6f') 4,(&,, B’). (4.21) 


Since det X} +: 0, it follows from (4.21) that the field equations (4.11) are equi- 
valent to the set of Lagrange equations {,;=0, 8;=0, t=0. That is, the field 
equation V;, a) may be replaced by the equation {,—0 in the set (4.11). 
Finally, we consider briefly the problem treated in [/] of an elastic dielectric 
placed in a vacuum. The variational principle of §3 still applies provided we 
take into account the constraint %W=0, =O outside the dielectric. Here we 
let the discontinuity surface (A) correspond to the boundary between the 
dielectric and vacuum. The field equations interior to the dielectric retain the 
form (4.10), but owing to the constraints $=W%W= 0, exterior to the dielectric, 
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we get the field equations § 


Vit, =0,.° ey )aV wp =O; (4.22) 
But when W=¥=0, t', has the form 
t' = & Vg(B°E; — 3 6; E*) (4.23) 


and TRAUTMAN’s identity reduces to the trivial one 
V;t', = & 9 V2 @ E;. (4.24) 


Thus the field equations (4.22) are not independent, and we may take the simple 
scalar equation (4.22), as the single vacuum field equation. 


‘ 
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